
<110> Japan Science and Technology Agency 
Kuroda, Shunichi 
Tani zawa , Katsuyuki 
Okajima, Toshihide 
Kondo, Akihiko 
ueda, Nasakazu 
Seno, Masahura 

<120> THERAPEUTIC DRUG USING ANTIBODY-DISPLAYING HOLLOW PROTEIN 
NANOPARTICLES AND HOLLOW PROTEIN NANOPARTICLES 

<130> 12480-000067/US 

<150> 10/509,249 
<151> 2004-09-28 

<160> 245 

<170> Patentin version 3.4 

<210> 1 

<211> 27 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 1 

gctggtggtg gtggtggtgg tggtggt 27 



<210> 2 

<211> 39 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 2 

ctgagcctga gggctgcggc cgcctcccat gccttgtcg 39 



<210> 3 

<211> 36 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 3 

ggggacctcg gatccgcgag cttaccagtt ctcaca 36 



<210> 4 

<211> 36 

<212> DNA 

<213> artificial sequence 
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<220> 

<223> Synthesized Oligonucleotide 



sequence listing_ST25.txt 



<400> 4 

gaggtcgacc agctttaacg aacgcagaat tttcga 



36 



<210> 5 

<211> 33 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 5 

ggccgctgga gccacccgca gttcgaaaaa ggc 33 



<210> 6 

<211> 33 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 



<210> 7 

<211> 29 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 7 

ggggtaccat gagatctttg ttgatcttg 29 



<210> 8 

<211> 28 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 



<210> 9 

<211> 36 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 



<400> 6 

ggccgccttt ttcgaactgc gggtggctcc age 



33 



<400> 8 

ggccgcggtt aaatgtatac ccaaagac 



28 
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sequence listing_ST25.txt 

<400> 9 

gggggcggcc gcgcgcaaca cgatgaagcc gtagac 36 



<210> 10 

<211> 36 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 10 

ggttgagata aaagagcttt tggcgcggcc gccttt 36 



<210> 11 

<211> 36 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 11 

cccgcggccg cccgaggaga cggtgactga ggtccc 36 



<210> 12 

<211> 36 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 12 

gggggcggcc gcgatgtgca gcttcaggag tcggga 36 



<210> 13 

<211> 30 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 13 

ggggcggccg ccttttattt ccaactttgt 



<210> 14 

<211> 30 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 14 

ggggcggccg ccttttattt ccaactttgt 



30 



30 
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sequence listing_ST25.txt 

<210> 15 
<211> 31 
<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 
<400> 15 

ccagttggac ggcggccgcc ctgcaccgaa c 



<210> 16 

<211> 31 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 16 

gttcggtgca gggcggccgc cgtccaactg g 



<210> 17 

<211> 34 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 17 

caatccagat tggggcggcc gccctgcacc gaac 



<210> 18 

<211> 34 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 18 

gttcggtgca gggcggccgc cccaatctgg attg 



<210> 19 

<211> 31 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 19 

ggtaggagcg ggcggccgcc ctgcaccgaa c 



<210> 20 
<211> 31 
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<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 
<400> 20 

gttcggtgca gggcggccgc ccgctcctac c 



<210> 21 

<211> 30 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 21 

cctcaggccg gcggccgccc tgcaccgaac 



<210> 22 

<211> 30 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized oligonucleotide 

<400> 22 

gttcggtgca gggcggccgc cctgaggatg 



<210> 23 

<211> 30 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized oligonucleotide 

<400> 23 

gttcggtgca gggcggccgc cctgaggatg 



<210> 24 

<211> 31 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized Oligonucleotide 

<400> 24 

cagagtgagg ggcggccgcc ctgcaccgaa c 



<210> 25 

<211> 31 

<212> DNA 

<213> artificial sequence 
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<220> 

<223> Synthesized Oligonucleotide 
<400> 25 

gttcggtgca gggcggccgc ccctcactct g 31 

<210> 26 

<211> 30 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized oligonucleotide 

<400> 26 

ggtaggagcg ggcggccgca gccctcaggc 30 

<210> 27 

<211> 30 

<212> DNA 

<213> artificial sequence 
<220> 

<223> Synthesized oligonucleotide 

<400> 27 

gcctgagggc tgcggccgcc cgctcctacc 30 

<210> 28 
<211> 10 
<212> PRT 

<213> artificial sequence 
<220> 

<223> artificial synthesized peptide sequence 
<400> 28 

ser Ala Trp Arg His Pro Gin Phe Gly Gly 
15 10 

<210> 29 
<211> 116 
<212> PRT 

<213> artificial sequence 
<220> 

<223> artificial synthesized peptide sequence 
<400> 29 

val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie 
15 10 15 

Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin 
20 25 30 
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sequence listing_ST25.txt 
Ser Leu Lys Asp Asp Pro Ser Gin ser Ala Asn Leu Leu Ala Glu Ala 
35 40 45 

Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Val Asp Asn Lys Phe Asn 
50 55 60 

Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu 
65 70 75 80 

Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro 
85 90 95 

Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala 
100 105 110 

Gin Ala Pro Lys 
115 

<210> 30 

<211> 39 

<212> DNA 

<213> artificial synthesized peptide sequence 
<220> 

<223> test fused 

<400> 30 

gctgctgctg ctgctgctag aagaagaaga agaagaaga 39 

<210> 31 

<211> 39 

<212> DNA 

<213> Artificial Sequence Fused Peptide 
<220> 

<223> 21-153 + zz (serotype y) sequence 

<400> 31 

gctgctgctg ctgctgctag aagaagaaga agaagaaga 39 

<210> 32 

<211> 378 

<212> PRT 

<213> artificial sequence 
<220> 

<223> protein corresonding to 21-153 + ZZ (serotype y) sequence 

<400> 32 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
1 5 10 15 



His Gin Leu Asp Gly Gly Arg Ala Gin His Asp Glu Ala Val Asp Asn 
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sequence listing_ST25.txt 
20 25 ^ 30 

Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu 
35 40 45 

Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys 
50 55 60 

Asp Asp Pro Ser Gin ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu 
65 70 75 80 

Asn Asp Ala Gin Ala Pro Lys val Asp Asn Lys Phe Asn Lys Glu Gin 
85 90 95 

Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu 
100 105 110 

Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin ser 
115 120 125 

Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro 
130 135 140 

Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe 
145 150 155 160 

Leu Gly Pro Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg 
165 170 175 

lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn 
180 185 190 

Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro 
195 200 205 

Thr Ser Asn His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr 
210 215 220 

Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu 
225 230 ~ 235 240 

Leu Cys Leu lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu 
245 250 255 

Pro Val Cys Pro Leu Leu Pro Gly Thr ser Thr Thr ser Thr Gly Pro 
260 265 270 
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sequence listing_ST25.txt 
Cys Lys Thr cys Thr lie Pro Ala Gin Gly Thr Ser Met Phe Pro Ser 
275 280 285 

Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie 
290 295 300 

Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser val 
305 310 " 315 320 

Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe val Gin Trp Phe Val 
325 330 335 

Gly Leu Ser Pro Thr val Trp Leu Ser val lie Trp Met Met Trp Tyr 
340 345 350 

Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu 
355 360 365 

Pro lie Phe Phe Cys Leu Trp val Tyr lie 
370 375 

<210> 33 

<211> 1134 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 21-153 (Q129R) + zz (serotype y) sequence 

<400> 33 



atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


ggcggccgcg 


cgcaacacga 


tgaagccgta 


gacaacaaat 


tcaacaaaga 


acaacaaaac 


120 


gcgttctatg 


agatcttaca 


tttacctaac 


ttaaacgaag 


aacaacgaaa 


cgccttcatc 


180 


caaagtttaa 


aagatgaccc 


aagccaaagc 


gctaaccttt 


tagcagaagc 


taaaaagcta 


240 


aatgatgctc 


aggcgccgaa 


agtagacaac 


aaattcaaca 


aagaacaaca 


aaacgcgttc 


300 


tatgagatct 


tacatttacc 


taacttaaac 


gaagaacaac 


gaaacgcctt 


catccaaagt 


360 


ttaaaagatg 


acccaagcca 


aagcgctaac 


cttttagcag 


aagctaaaaa 


gctaaatgat 


420 


gctcaggcgc 


cgaaagcggc 


cgcccctgca 


ccgaacatgg 


agaacacaac 


atcaggattc 


480 


ctaggacccc 


tgctcgtgtt 


acaggcgggg 


tttttcttgt 


tgacaagaat 


cctcacaata 


540 


ccacagagtc 


tagactcgtg 


gtggacttct 


ctcaattttc 


tagggggagc 


acccacgtgt 


600 


cctggccaaa 


attcgcagtc 


cccaacctcc 


aatcactcac 


caacctcttg 


tcctccaatt 


660 


tgtcctggct 


atcgctggat 


gtgtctgcgg 


cgttttatca 


tattcctctt 


catcctgctg 


720 


ctatgcctca 


tcttcttgtt 


ggttcttctg 


gactaccaag 
Page 


gtatgttgcc 
9 


cgtttgtcct 


780 



sequence listing_ST25.txt 
ctacttccag gaacatcaac caccagcacg gggccatgca agacctgcac gattcctgct 840 

cgaggaacct ctatgtttcc ctcttgttgc tgtacaaaac cttcggacgg aaactgcact 900 

tgtattccca tcccatcatc ctgggctttc gcaagattcc tatgggagtg ggcctcagtc 960 

cgtttctcct ggctcagttt actagtgcca tttgttcagt ggttcgtagg gctttccccc 1020 

actgtttggc tttcagttat atggatgatg tggtattggg ggccaagtct gtacaacatc 1080 

ttgagtccct ttttacctct attaccaatt ttcttttgtc tttgggtata catt 1134 

<210> 34 

<211> 378 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Protein corresponding to 21-153 (Q129R) + zz (serotype y) 
sequence 

<400> 34 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Gly Gly Arg Ala Gin His Asp Glu Ala Val Asp Asn 
20 25 30 

Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu 
35 40 45 

Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys 
50 55 60 

Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu 
65 70 75 80 

Asn Asp Ala Gin Ala Pro Lys val Asp Asn Lys Phe Asn Lys Glu Gin 
85 90 95 

Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu 
100 105 110 

Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser 
115 120 125 

Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro 
130 135 140 

Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe 
145 150 155 160 
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sequence "listing_ST25.txt 

Leu Gly Pro Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg 
165 170 175 

lie Leu Thr lie Pro Gin ser Leu Asp Ser Trp Trp Thr Ser Leu Asn 
180 185 190 

Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro 
195 200 205 

Thr Ser Asn His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr 
210 215 220 

Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu 
225 230 235 240 

Leu Cys Leu lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu 
245 250 255 

Pro val Cys Pro Leu Leu Pro Gly Thr Ser Thr Thr ser Thr Gly Pro 
260 265 270 

Cys Lys Thr Cys Thr lie Pro Ala Arg Gly Thr Ser Met Phe Pro Ser 
275 280 285 

Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys Thr cys lie Pro lie 
290 295 300 

Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val 
305 310 315 320 

Arg Phe Ser Trp Leu Ser Leu Leu val Pro Phe val Gin Trp Phe val 
325 330 335 

Gly Leu Ser Pro Thr Val Trp Leu Ser Val lie Trp Met Met Trp Tyr 
340 345 350 

Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu 
355 360 365 

Pro lie Phe Phe Cys Leu Trp val Tyr lie 
370 375 

<210> 35 
<211> 1134 
<212> DNA 

<213> artificial sequence 
<220> 
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sequence listing_ST25.txt 
<223> 21-153 (G145R) + ZZ (serotype y) sequence 

<400> 35 



atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


ggcggccgcg 


cgcaacacga 


tgaagccgta 


gacaacaaat 


tcaacaaaga 


acaacaaaac 


120 


gcgttctatg 


agatcttaca 


tttacctaac 


ttaaacgaag 


aacaacgaaa 


cgccttcatc 


180 


caaagtttaa 


aagatgaccc 


aagccaaagc 


gctaaccttt 


tagcagaagc 


taaaaagcta 


240 


aatgatgctc 


aggcgccgaa 


agtagacaac 


aaattcaaca 


aagaacaaca 


aaacgcgttc 


300 


tatgagatct 


tacatttacc 


taacttaaac 


gaagaacaac 


gaaacgcctt 


catccaaagt 


360 


ttaaaagatg 


acccaagcca 


aagcgctaac 


cttttagcag 


aagctaaaaa 


gctaaatgat 


420 


gctcaggcgc 


cgaaagcggc 


cgcccctgca 


ccgaacatgg 


agaacacaac 


atcaggattc 


480 


ctaggacccc 


tgctcgtgtt 


acaggcgggg 


tttttcttgt 


tgacaagaat 


cctcacaata 


540 


ccacagagtc 


tagactcgtg 


gtggacttct 


ctcaattttc 


tagggggagc 


acccacgtgt 


600 


cctggccaaa 


attcgcagtc 


cccaacctcc 


aatcactcac 


caacctcttg 


tcctccaatt 


660 


tgtcctggct 


atcgctggat 


gtgtctgcgg 


cgttttatca 


tattcctctt 


catcctgctg 


720 


ctatgcctca 


tcttcttgtt 


ggttcttctg 


gactaccaag 


gtatgttgcc 


cgtttgtcct 


780 


ctacttccaa 


aaacatcaac 


CaCCaacaca 


QQQCCAtQCa 
y y y v-a 


aaacctacac 

ay civ. v. l^luu 




840 


caaggaacct 


ctatgtttcc 


ctcttgttgc 


tgtacaaaac 


cttcggacag 


aaactgcact 


900 


tgtattccca 


tcccatcatc 


ctgggctttc 


gcaagattcc 


tatgggagtg 


ggcctcagtc 


960 


cgtttctcct 


ggctcagttt 


actagtgcca 


tttgttcagt 


ggttcgtagg 


gctttccccc 


1020 


actgtttggc 


tttcagttat 


atggatgatg 


tggtattggg 


ggccaagtct 


gtacaacatc 


1080 


ttgagtccct 


ttttacctct 


attaccaatt 


ttcttttgtc 


tttgggtata 


catt 


1134 



<210> 36 

<211> 378 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Protein corresponding to 21-153 (G145R) + ZZ (serotype y) 
sequence 

<400> 36 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Gly Gly Arg Ala Gin His Asp Glu Ala val Asp Asn 
20 25 30 



Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu 
35 40 45 
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sequence listing_ST25.txt 

Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys 
50 55 60 

Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu 
65 70 75 80 

Asn Asp Ala Gin Ala Pro Lys Val Asp Asn Lys Phe Asn Lys Glu Gin 
85 90 95 

Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu 
100 105 110 

Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser 
115 120 125 

Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro 
130 135 140 

Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe 
145 150 155 160 

Leu Gly Pro Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg 
165 170 175 

lie Leu Thr lie Pro Gin ser Leu Asp Ser Trp Trp Thr ser Leu Asn 
180 185 190 

Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin ser Pro 
195 200 205 

Thr Ser Asn His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr 
210 215 220 

Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu 
225 * 230 235 240 

Leu Cys Leu lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu 
245 250 255 

Pro val Cys Pro Leu Leu Pro Gly Thr Ser Thr Thr ser Thr Gly Pro 
260 265 270 

Cys Lys Thr Cys Thr lie Pro Ala Gin Gly Thr Ser Met Phe Pro Ser 
275 280 285 

Cys Cys Cys Thr Lys Pro Ser Asp Arg Asn Cys Thr Cys lie Pro lie 
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sequence "listing_ST25.txt 
290 295 300 

Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser val 
305 310 ~ 315 320 

Arg Phe Ser Trp Leu ser Leu Leu val Pro Phe val Gin Trp Phe val 
325 330 335 

Gly Leu Ser Pro Thr Val Trp Leu Ser Val lie Trp Met Met Trp Tyr 
340 345 350 

Trp Gly Pro ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu 
355 360 365 

Pro lie Phe Phe Cys Leu Trp val Tyr lie 
370 375 

<210> 37 

<211> 1134 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 21-153 (Q129R/G145R) + ZZ-tag (serotype y) sequence 



<400> 37 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


ggcggccgcg 


cgcaacacga 


tgaagccgta 


gacaacaaat 


tcaacaaaga 


acaacaaaac 


120 


gcgttctatg 


agatcttaca 


tttacctaac 


ttaaacgaag 


aacaacgaaa 


cgccttcatc 


180 


caaagtttaa 


aagatgaccc 


aagccaaagc 


gctaaccttt 


tagcagaagc 


taaaaagcta 


240 


aatgatgctc 


aggcgccgaa 


agtagacaac 


aaattcaaca 


aagaacaaca 


aaacgcgttc 


300 


tatgagatct 


tacatttacc 


taacttaaac 


gaagaacaac 


gaaacgcctt 


catccaaagt 


360 


ttaaaagatg 


acccaagcca 


aagcgctaac 


cttttagcag 


aagctaaaaa 


gctaaatgat 


420 


gctcaggcgc 


cgaaagcggc 


cgcccctgca 


ccgaacatgg 


agaacacaac 


atcaggattc 


480 


ctaggacccc 


tgctcgtgtt 


acaggcgggg 


tttttcttgt 


tgacaagaat 


cctcacaata 


540 


ccacagagtc 


tagactcgtg 


gtggacttct 


ctcaattttc 


tagggggagc 


acccacgtgt 


600 


cctggccaaa 


attcgcagtc 


cccaacctcc 


aatcactcac 


caacctcttg 


tcctccaatt 


660 


tgtcctggct 


atcgctggat 


gtgtctgcgg 


cgttttatca 


tattcctctt 


catcctgctg 


720 


ctatgcctca 


tcttcttgtt 


ggttcttctg 


gactaccaag 


gtatgttgcc 


cgtttgtcct 


780 


ctacttccag 


gaacatcaac 


caccagcacg 


gggccatgca 


agacctgcac 


gattcctgct 


840 


cgaggaacct 


ctatgtttcc 


ctcttgttgc 


tgtacaaaac 


cttcggacag 


aaactgcact 


900 


tgtattccca 


tcccatcatc 


ctgggctttc 


gcaagattcc 
page 


tatgggagtg 
14 


ggcctcagtc 


960 



sequence listing_ST25.txt 

cgtttctcct ggctcagttt actagtgcca tttgttcagt ggttcgtagg gctttccccc 1020 

actgtttggc tttcagttat atggatgatg tggtattggg ggccaagtct gtacaacatc 1080 

ttgagtccct ttttacctct attaccaatt ttcttttgtc tttgggtata catt 1134 

<210> 38 

<211> 378 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Protein corresponding to 21-153 (Q129R/G145R) + zz-tag (serotype 
y) sequence 

<400> 38 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Gly Gly Arg Ala Gin His Asp Glu Ala val Asp Asn 
20 25 30 

Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu 
35 40 45 

Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys 
50 55 60 

Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu 
65 70 75 80 

Asn Asp Ala Gin Ala Pro Lys val Asp Asn Lys Phe Asn Lys Glu Gin 
85 90 95 

Gin Asn Ala Phe Tyr Glu He Leu His Leu Pro Asn Leu Asn Glu Glu 
100 105 110 

Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser 
115 120 125 

Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro 
130 135 140 

Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe 
145 150 155 160 

Leu Gly Pro Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg 
165 170 175 
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lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn 
180 185 190 

Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro 
195 200 205 

Thr Ser Asn His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr 
210 215 220 

Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu 
225 230 235 240 

Leu Cys Leu lie Phe Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu 
245 250 255 

Pro val Cys Pro Leu Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro 
260 265 270 

Cys Lys Thr Cys Thr lie Pro Ala Arg Gly Thr Ser Met Phe Pro Ser 
275 280 ~ 285 

Cys Cys Cys Thr Lys Pro Ser Asp Arg Asn Cys Thr Cys lie Pro lie 
290 295 300 

Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser val 
305 310 315 320 

Arg Phe Ser Trp Leu Ser Leu Leu val Pro Phe Val Gin Trp Phe val 
325 330 335 

Gly Leu Ser Pro Thr val Trp Leu Ser Val lie Trp Met Met Trp Tyr 
340 345 350 

Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu 
355 360 365 

Pro lie Phe Phe Cys Leu Trp val Tyr lie 
370 375 

<210> 39 
<211> 933 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 21-153 -i- EGF (serotype y) sequence 
<400> 39 

atggggacga atctttctgt tcccaatcct ctgggattct ttcccgatca ccagttggac 
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ggcggccgca tgaactctga ttccgaatgc ccgctgtctc atgacggtta ctgcctgcat 120 

gatggcgtat gcatgtacat cgaagctctg gacaaatacg catgcaactg tgttgtaggt 180 

tacatcggcg aacgttgcca gtatcgcgac ctgaaatggt gggaactgcg taaggcggcc 240 

gcccctgcac cgaacatgga gaacacaaca tcaggattcc taggacccct gctcgtgtta 300 

caggcggggt ttttcttgtt gacaagaatc ctcacaatac cacagagtct agactcgtgg 360 

tggacttctc tcaattttct agggggagca cccacgtgtc ctggccaaaa ttcgcagtcc 420 

ccaacctcca atcactcacc aacctcttgt cctccaattt gtcctggcta tcgctggatg 480 

tgtctgcggc gttttatcat attcctcttc atcctgctgc tatgcctcat cttcttgttg 540 

gttcttctgg actaccaagg tatgttgccc gtttgtcctc tacttccagg aacatcaacc 600 

accagcacgg ggccatgcaa gacctgcacg attcctgctc aaggaacctc tatgtttccc 660 

tcttgttgct gtacaaaacc ttcggacgga aactgcactt gtattcccat cccatcatcc 720 

tgggctttcg caagattcct atgggagtgg gcctcagtcc gtttctcctg gctcagttta 780 

ctagtgccat ttgttcagtg gttcgtaggg ctttccccca ctgtttggct ttcagttata 840 

tggatgatgt ggtattgggg gccaagtctg tacaacatct tgagtccctt tttacctcta 900 

ttaccaattt tcttttgtct ttgggtatac att 933 

<210> 40 

<211> 311 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 21-153 + EGF (serotype y) sequence 

<400> 40 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 



His Gin Leu Asp Gly Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu 
20 " 25 30 

Ser His Asp Gly Tyr Cys Leu His Asp Gly val Cys Met Tyr lie Glu 

35 40 45 

Ala Leu Asp Lys Tyr Ala Cys Asn Cys val val Gly Tyr lie Gly Glu 
50 55 ' 60 

Arg Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala 
65 70 75 80 

Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro 
85 90 95 
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Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr 
100 105 110 

lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly 
115 120 125 

Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin ser Pro Thr Ser Asn 
130 135 140 

His Ser Pro Thr Ser cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met 
145 150 155 160 

cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu 
165 170 175 

lie Phe Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys 
180 185 190 

Pro Leu Leu Pro Gly Thr Ser Thr Thr ser Thr Gly Pro Cys Lys Thr 
195 200 205 

Cys Thr lie Pro Ala Gin Gly Thr ser Met Phe Pro Ser Cys Cys Cys 
210 215 220 

Thr Lys Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser ser 
225 230 235 240 

Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser 
245 250 255 

Trp Leu Ser Leu Leu val Pro Phe val Gin Trp Phe val Gly Leu Ser 
260 265 270 

Pro Thr Val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro 
275 280 285 

Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe 
290 295 300 

Phe Cys Leu Trp val Tyr lie 
305 310 

<210> 41 
<211> 933 
<212> DNA 

<213> artificial sequence 
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<220> 

<223> 21-153 (Q129R) + EGF (serotype y) sequence 



<400> 41 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


ggcggccgca 


tgaactctga 


ttccgaatgc 


ccgctgtctc 


atgacggtta 


ctgcctgcat 


120 


gatggcgtat 


gcatgtacat 


cgaagctctg 


gacaaatacg 


catgcaactg 


tgttgtaggt 


180 


tacatcggcg 


aacgttgcca 


gtatcgcgac 


ctgaaatggt 


gggaactgcg 


taaggcggcc 


240 


gcccctgcac 


cgaacatgga 


gaacacaaca 


tcaggattcc 


taggacccct 


gctcgtgtta 


300 


caggcggggt 


ttttcttgtt 


gacaagaatc 


ctcacaatac 


cacagagtct 


agactcgtgg 


360 


tggacttctc 


tcaattttct 


agggggagca 


cccacgtgtc 


ctggccaaaa 


ttcgcagtcc 


420 


ccaacctcca 


atcactcacc 


aacctcttgt 


cctccaattt 


gtcctggcta 


tcgctggatg 


480 


tgtctgcggc 


gttttatcat 


attcctcttc 


atcctgctgc 


tatgcctcat 


cttcttgttg 


540 


gttcttctgg 


actaccaagg 


tatgttgccc 


gtttgtcctc 


tacttccagg 


aacatcaacc 


600 


CH.LCLyi.Cll.yy 


y y LLd uy Lad 


y dLL uy Ld(_y 


dLLLLLytLL 


ydyyadLLXL 


tatntttrrr 
Id T.y L L LLLL 


DDU 


tcttgttgct 


gtacaaaacc 


ttcggacgga 


aactgcactt 


gtattcccat 


cccatcatcc 


720 


tgggctttcg 


caagattcct 


atgggagtgg 


gcctcagtcc 


gtttctcctg 


gctcagttta 


780 


ctagtgccat 


ttgttcagtg 


gttcgtaggg 


ctttccccca 


ctgtttggct 


ttcagttata 


840 


tggatgatgt 


ggtattgggg 


gccaagtctg 


tacaacatct 


tgagtccctt 


tttacctcta 


900 


ttaccaattt 


tcttttgtct 


ttgggtatac 


att 






933 



<210> 42 

<211> 311 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 21-153 (Q129R) + EGF (serotype y) 
sequence 

<400> 42 

Met Gly Thr Asn Leu ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Gly Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu 
20 25 30 

Ser His Asp Gly Tyr Cys Leu His Asp Gly val Cys Met Tyr lie Glu 
35 40 45 

Ala Leu Asp Lys Tyr Ala Cys Asn Cys val val Gly Tyr lie Gly Glu 
50 55 60 
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Arg Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala 
65 70 75 80 

Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro 
85 90 95 

Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr 
100 105 110 

lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly 
115 120 125 

Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr ser Asn 
130 135 140 

His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met 
145 150 155 * ~ 160 

Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu 
165 170 175 

lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys 
180 185 190 

Pro Leu Leu Pro Gly Thr Ser Thr Thr ser Thr Gly Pro Cys Lys Thr 
195 200 205 

Cys Thr lie Pro Ala Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys 
210 215 220 

Thr Lys Pro Ser Asp Gly Asn cys Thr Cys lie Pro lie Pro Ser ser 
225 230 235 240 

Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe ser 
245 250 255 

Trp Leu Ser Leu Leu val Pro Phe val Gin Trp Phe val Gly Leu Ser 
260 265 270 

Pro Thr Val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro 
275 280 285 

Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe 
290 295 300 

Phe Cys Leu Trp val Tyr lie 
305 310 
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<210> 43 
<211> 933 
<212> DNA 

<213> artificial sequence 



<220> 

<223> 21-153 (G145R) + EGF (serotype y) sequence 



<400> 43 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


ggcggccgca 


tgaactctga 


ttccgaatgc 


ccgctgtctc 


atgacggtta 


ctgcctgcat 


120 


gatggcgtat 


gcatgtacat 


cgaagctctg 


gacaaatacg 


catgcaactg 


tgttgtaggt 


180 


tacatcggcg 


aacgttgcca 


gtatcgcgac 


ctgaaatggt 


gggaactgcg 


taaggcggcc 


240 


gcccctgcac 


cgaacatgga 


gaacacaaca 


tcaggattcc 


taggacccct 


gctcgtgtta 


300 


caggcggggt 


ttttcttgtt 


gacaagaatc 


ctcacaatac 


cacagagtct 


agactcgtgg 


360 


tggacttctc 


tcaattttct 


agggggagca 


cccacgtgtc 


ctggccaaaa 


ttcgcagtcc 


420 


ccaacctcca 


atcactcacc 


aacctcttgt 


cctccaattt 


gtcctggcta 


tcgctggatg 


480 


tgtctgcggc 


gttttatcat 


attcctcttc 


atcctgctgc 


tatgcctcat 


cttcttgttg 


540 


gttcttctgg 


actaccaagg 


tatgttgccc 


gtttgtcctc 


tacttccagg 


aacatcaacc 


600 


accagcacgg 


ggccatgcaa 


gacctgcacg 


attcctgctc 


aaggaacctc 


tatgtttccc 


660 


tcttgttgct 


gtacaaaacc 


ttcggacaga 


aactgcactt 


gtattcccat 


cccatcatcc 


720 


tgggctttcg 


caagattcct 


atgggagtgg 


gcctcagtcc 


gtttctcctg 


gctcagttta 


780 


ctagtgccat 


ttgttcagtg 


gttcgtaggg 


ctttccccca 


ctgtttggct 


ttcagttata 


840 


tggatgatgt 


ggtattgggg 


gccaagtctg 


tacaacatct 


tgagtccctt 


tttacctcta 


900 


ttaccaattt 


tcttttgtct 


ttgggtatac 


att 






933 



<210> 44 

<211> 311 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 21-153 (G145R) + EGF (serotype y) 
sequence 

<400> 44 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Gly Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu 
20 25 30 
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Ser His Asp Gly Tyr Cys Leu His Asp Gly Val Cys Met Tyr lie Glu 
35 40 45 

Ala Leu Asp Lys Tyr Ala Cys Asn Cys val val Gly Tyr lie Gly Glu 
50 55 60 

Arg Cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala 
65 70 75 " 80 

Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro 
85 90 95 

Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr 
100 105 110 

lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly 
115 120 125 

Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn 
130 135 140 

His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met 
145 150 155 ~ 160 

Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu 
165 170 175 

lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys 
180 185 190 

Pro Leu Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr 
195 200 205 

Cys Thr lie Pro Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys 
210 215 220 

Thr Lys Pro Ser Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser Ser 
225 230 235 240 

Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser 
245 250 255 

Trp Leu Ser Leu Leu val Pro Phe val Gin Trp Phe Val Gly Leu Ser 
260 265 270 

Pro Thr val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro 
275 280 285 
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Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe 
290 295 300 

Phe Cys Leu Trp val Tyr lie 
305 310 

<210> 45 

<211> 933 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 21-153 (Q129R/G145R) + EGF (serotype y) sequence 

<400> 45 



atggggacga atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


ggcggccgca 


tgaactctga 


ttccgaatgc 


ccgctgtctc 


atgacggtta 


ctgcctgcat 


120 


gatggcgtat 


gcatgtacat 


cgaagctctg 


gacaaatacg 


catgcaactg 


tgttgtaggt 


180 


tacatcggcg 


aacgttgcca 


gtatcgcgac 


ctgaaatggt 


gggaactgcg 


taaggcggcc 


240 


gcccctgcac 


cgaacatgga 


gaacacaaca 


tcaggattcc 


taggacccct 


gctcgtgtta 


300 


caggcggggt 


ttttcttgtt 


gacaagaatc 


ctcacaatac 


cacagagtct 


agactcgtgg 


360 


tggacttctc 


tcaattttct 


agggggagca 


cccacgtgtc 


ctggccaaaa 


ttcgcagtcc 


420 


ccaacctcca 


atcactcacc 


aacctcttgt 


cctccaattt 


gtcctggcta 


tcgctggatg 


480 


tgtctgcggc 


gttttatcat 


attcctcttc 


atcctgctgc 


tatgcctcat 


cttcttgttg 


540 


gttcttctgg 


actaccaagg 


tatgttgccc 


gtttgtcctc 


tacttccagg 


aacatcaacc 


600 


accagcacgg 


ggccatgcaa 


gacctgcacg 


attcctgctc 


gaggaacctc 


tatgtttccc 


660 


tcttgttgct 


gtacaaaacc 


ttcggacaga 


aactgcactt 


gtattcccat 


cccatcatcc 


720 


tgggctttcg 


caagattcct 


atgggagtgg 


gcctcagtcc 


gtttctcctg 


gctcagttta 


780 


ctagtgccat 


ttgttcagtg 


gttcgtaggg 


ctttccccca 


ctgtttggct 


ttcagttata 


840 


tggatgatgt 


ggtattgggg 


gccaagtctg 


tacaacatct 


tgagtccctt 


tttacctcta 


900 


ttaccaattt 


tcttttgtct 


ttgggtatac 


att 






933 



<210> 46 

<211> 311 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 21-153 (Q129R/G145R) + EGF (serotype y) 
sequence 

<400> 46 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
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1 5 10 15 

His Gin Leu Asp Gly Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu 
20 25 30 

Ser His Asp Gly Tyr Cys Leu His Asp Gly val Cys Met Tyr lie Glu 
35 40 45 

Ala Leu Asp Lys Tyr Ala Cys Asn Cys Val Val Gly Tyr lie Gly Glu 
50 55 60 

Arg cys Gin Tyr Arg Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala 
65 70 75 80 

Ala Pro Ala Pro Asn Met Glu Asn Thr Thr ser Gly Phe Leu Gly Pro 
85 90 95 

Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr 
100 105 110 

lie Pro Gin ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly 
115 120 125 

Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr ser Asn 
130 135 140 

His ser Pro Thr ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met 
145 150 155 ~ 160 

Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu 
165 170 175 

lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys 
180 185 190 

Pro Leu Leu Pro Gly Thr Ser Thr Thr ser Thr Gly Pro Cys Lys Thr 
195 200 205 

Cys Thr lie Pro Ala Arg Gly Thr ser Met Phe Pro Ser Cys Cys Cys 
210 215 220 

Thr Lys Pro Ser Asp Arg Asn Cys Thr Cys He Pro He Pro Ser Ser 
225 230 235 240 

Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser 
245 250 255 
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Trp Leu Ser Leu Leu val Pro Phe val Gin Trp Plie val Gly Leu Ser 
260 265 270 



pro Thr val Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro 
275 280 285 



Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro He Phe 
290 295 300 



Phe cys Leu Trp Val Tyr lie 
305 310 



<210> 47 

<211> 792 

<212> DNA 

<213> artificial sequence 



<220> 

<223> 21-153 + STR-tag (streptavidin) (serotype y) sequence 



<400> 47 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


ggcggccgct 


ggagccaccc 


gcagttcgaa 


aaagcggccg 


cccctgcacc 


gaacatggag 


120 


aacacaacat 


caggattcct 


aggacccctg 


ctcgtgttac 


aggcggggtt 


tttcttgttg 


180 


acaagaatcc 


tcacaatacc 


acagagtcta 


gactcgtggt 


ggacttctct 


caattttcta 


240 


gggggagcac 


ccacgtgtcc 


tggccaaaat 


tcgcagtccc 


caacctccaa 


tcactcacca 


300 


acctcttgtc 


ctccaatttg 


tcctggctat 


cgctggatgt 


gtctgcggcg 


ttttatcata 


360 


ttcctcttca 


tcctgctgct 


atgcctcatc 


ttcttgttgg 


ttcttctgga 


ctaccaaggt 


420 


atgttgcccg 


tttgtcctct 


acttccagga 


acatcaacca 


ccagcacggg 


gccatgcaag 


480 


acctgcacga 


ttcctgctca 


aggaacctct 


atgtttccct 


cttgttgctg 


tacaaaacct 


540 


tcggacggaa 


actgcacttg 


tattcccatc 


ccatcatcct 


gggctttcgc 


aagattccta 


600 


tgggagtggg 


cctcagtccg 


tttctcctgg 


ctcagtttac 


tagtgccatt 


tgttcagtgg 


660 


ttcgtagggc 


tttcccccac 


tgtttggctt 


tcagttatat 


ggatgatgtg 


gtattggggg 


720 


ccaagtctgt 


acaacatctt 


gagtcccttt 


ttacctctat 


taccaatttt 


cttttgtctt 


780 


tgggtataca 


tt 










792 



<210> 48 
<211> 264 
<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 21-153 + STR-tag (streptavidin) 
(serotype y) sequence 
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<400> 48 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
1 5 10 15 

His Gin Leu Asp Gly Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala 
20 25 30 

Ala Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly 
35 40 45 

Pro Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu 
50 55 60 

Thr lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu 
65 70 75 80 

Gly Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser 
85 90 95 

Asn His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp 
100 105 110 

Met Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys 
115 ~ 120 125 

Leu lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val 
130 135 140 

Cys Pro Leu Leu Pro Gly Thr Ser Thr Thr ser Thr Gly Pro Cys Lys 
145 150 155 * 160 

Thr Cys Thr lie Pro Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys 
165 170 175 

cys Thr Lys Pro Ser Asp Gly Asn cys Thr Cys lie Pro lie Pro Ser 
180 185 190 

Ser Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe 
195 200 205 

Ser Trp Leu Ser Leu Leu val Pro Phe Val Gin Trp Phe val Gly Leu 
210 215 220 

Ser Pro Thr Val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly 
225 230 235 240 

Pro Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie 
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245 250 255 



Phe Phe Cys Leu Trp val Tyr lie 
260 



<210> 49 

<211> 792 

<212> DNA 

<213> artificial sequence 



<220> 

<223> 21-153 (Q129R) + STR (serotype y) sequence 



<400> 49 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


ggcggccgct 


ggagccaccc 


gcagttcgaa 


aaagcggccg 


cccctgcacc 


gaacatggag 


120 




rannattrrt 


annarrrrtn 


rtrntnttar 


aggcggggri: 


LtT.CT.T.gT.Lg 


-LOU 


acaagaatcc 


tcacaatacc 


acagagtcta 


gactcgtggt 


ggacttctct 


caattttcta 


240 


gggggagcac 


ccacgtgtcc 


tggccaaaat 


tcgcagtccc 


caacctccaa 


tcactcacca 


300 


acctcttgtc 


ctccaatttg 


tcctggctat 


cgctggatgt 


gtctgcggcg 


ttttatcata 


360 


ttcctcttca 


tcctgctgct 


atgcctcatc 


ttcttgttgg 


ttcttctgga 


ctaccaaggt 


420 


atgttgcccg 


tttgtcctct 


acttccagga 


acatcaacca 


ccagcacggg 


gccatgcaag 


480 


acctgcacga 


ttcctgctcg 


aggaacctct 


atgtttccct 


cttgttgctg 


tacaaaacct 


540 


tcggacggaa 


actgcacttg 


tattcccatc 


ccatcatcct 


gggctttcgc 


aagattccta 


600 


tgggagtggg 


cctcagtccg 


tttctcctgg 


ctcagtttac 


tagtgccatt 


tgttcagtgg 


660 


ttcgtagggc 


tttcccccac 


tgtttggctt 


tcagttatat 


ggatgatgtg 


gtattggggg 


720 


ccaagtctgt 


acaacatctt 


gagtcccttt 


ttacctctat 


taccaatttt 


cttttgtctt 


780 


tgggtataca 


tt 










792 



<210> 50 

<211> 264 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 21-153 (Q129R) + STR (serotype y) 
sequence 

<400> 50 

Met Gly Thr Asn Leu ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 



His Gin Leu Asp Gly Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala 
20 25 30 
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Ala Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly 
35 40 45 

Pro Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu 
50 55 60 

Thr lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu 
65 70 75 80 

Gly Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser 
85 90 95 

Asn His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp 
100 105 110 

Met Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys 
115 " 120 125 

Leu lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val 
130 135 140 

Cys Pro Leu Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys 
145 150 155 160 

Thr Cys Thr He Pro Ala Arg Gly Thr Ser Met Phe Pro ser cys Cys 
165 170 175 

Cys Thr Lys Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser 
180 185 190 

Ser Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe 
195 " 200 205 

Ser Trp Leu Ser Leu Leu val Pro Phe val Gin Trp Phe val Gly Leu 
210 215 220 

Ser Pro Thr val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly 
225 230 235 240 

Pro Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie 
245 250 255 

Phe Phe Cys Leu Trp val Tyr lie 
260 

<210> 51 
<211> 792 
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<212> DNA 

<213> artificial sequence 
<220> 

<223> 21-153 (G145R) + STR (serotype y) sequence 



<400> 51 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


ggcggccgct 


ggagccaccc 


gcagttcgaa 


aaagcggccg 


cccctgcacc 


gaacatggag 


120 


aacacaacat 


caggattcct 


aggacccctg 


ctcgtgttac 


aggcggggtt 


tttcttgttg 


180 


acaagaatcc 


tcacaatacc 


acagagtcta 


gactcgtggt 


ggacttctct 


caattttcta 


240 


gggggagcac 


ccacgtgtcc 


tggccaaaat 


tcgcagtccc 


caacctccaa 


tcactcacca 


300 


acctcttgtc 


ctccaatttg 


tcctggctat 


cgctggatgt 


gtctgcggcg 


ttttatcata 


360 


ttcctcttca 


tcctgctgct 


atgcctcatc 


ttcttgttgg 


ttcttctgga 


ctaccaaggt 


420 


atgttgcccg 


tttgtcctct 


acttccagga 


acatcaacca 


ccagcacggg 


gccatgcaag 


480 


acctgcacga 


ttcctgctca 


aggaacctct 


atgtttccct 


cttgttgctg 


tacaaaacct 


540 


tcggacagaa 


actgcacttg 


tattcccatc 


ccatcatcct 


gggctttcgc 


aagattccta 


600 


tgggagtggg 


cctcagtccg 


tttctcctgg 


ctcagtttac 


tagtgccatt 


tgttcagtgg 


660 


ttcgtagggc 


tttcccccac 


tgtttggctt 


tcagttatat 


ggatgatgtg 


gtattggggg 


720 


ccaagtctgt 


acaacatctt 


gagtcccttt 


ttacctctat 


taccaatttt 


cttttgtctt 


780 


tgggtataca 


tt 










792 



<210> 52 

<211> 264 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 21-153 (G145R) + STR (serotype y) 
sequence 

<400> 52 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 



His Gin Leu Asp Gly Gly Arg Trp ser His Pro Gin Phe Glu Lys Ala 
20 25 30 



Ala Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly 
35 40 45 



Pro Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu 
50 55 60 
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Thr lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu 
65 70 75 80 

Gly Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser 
85 90 95 

Asn His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp 
100 105 110 

Met Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys 
115 " 120 125 

Leu lie Phe Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val 
130 135 140 

Cys Pro Leu Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys 
145 150 155 160 

Thr Cys Thr lie Pro Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys 
165 170 175 

Cys Thr Lys Pro Ser Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser 
180 185 190 

Ser Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe 
195 200 205 

Ser Trp Leu Ser Leu Leu Val Pro Phe Val Gin Trp Phe val Gly Leu 
210 215 220 

Ser Pro Thr val Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly 
225 230 235 240 

Pro Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie 
245 250 255 

Phe Phe Cys Leu Trp Val Tyr lie 
260 

<210> 53 
<211> 792 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 21-153 (Ql29R/Gl45R)/STR-tag (serotype y) sequence 
<400> 53 

atggggacga atctttctgt tcccaatcct ctgggattct ttcccgatca ccagttggac 
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ggagccaccc 


sequence listing_ST25.txt 
gcagttcgaa aaagcggccg cccctgcacc 


gaacatggag 


120 


aacacaacat 


caggattcct 


aggacccctg 


ctcgtgttac 


aggcggggtt 


tttcttgttg 


180 


acaagaatcc 


tcacaatacc 


acagagtcta 


gactcgtggt 


ggacttctct 


caattttcta 


240 


gggggagcac 


ccacgtgtcc 


tggccaaaat 


tcgcagtccc 


caacctccaa 


tcactcacca 


300 


acctcttgtc 


ctccaatttg 


tcctggctat 


cgctggatgt 


gtctgcggcg 


ttttatcata 


360 


ttcctcttca 


tcctgctgct 


atgcctcatc 


ttcttgttgg 


ttcttctgga 


ctaccaaggt 


420 


atgttgcccg 


tttgtcctct 


acttccagga 


acatcaacca 


ccagcacggg 


gccatgcaag 


480 


acctgcacga 


ttcctgctcg 


aggaacctct 


atgtttccct 


cttgttgctg 


tacaaaacct 


540 


tcggacagaa 


actgcacttg 


tattcccatc 


ccatcatcct 


gggctttcgc 


aagattccta 


600 


ugggagxggg 




tttctcctgg 


ctcagtttac 


tagtgccatt 


Ly l Lv_dy Lyy 


uw 


ttcgtagggc 


tttcccccac 


tgtttggctt 


tcagttatat 


ggatgatgtg 


gtattggggg 


720 


ccaagtctgt 


acaacatctt 


gagtcccttt 


ttacctctat 


taccaatttt 


cttttgtctt 


780 


tgggtataca 


tt 










792 



<210> 54 

<211> 264 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 21-153 (Ql29R/Gl45R)/STR-tag (serotype 
y) sequence 

<400> 54 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
1 5 10 15 

His Gin Leu Asp Gly Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala 
20 25 30 

Ala Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly 
35 40 45 

Pro Leu Leu Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu 
50 55 " 60 

Thr lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu 
65 70 75 80 

Gly Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser 
85 90 95 



Asn His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp 

Page 31 



I 



sequence listing_ST25.txt 
100 105 ~ 110 

Met Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys 
115 ~ 120 125 

Leu lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val 
130 135 140 

Cys Pro Leu Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys 
145 150 155 160 

Thr Cys Thr lie Pro Ala Arg Gly Thr Ser Met Phe Pro Ser cys Cys 
165 170 175 

Cys Thr Lys Pro Ser Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser 
180 185 190 

Ser Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe 
195 200 205 

ser Trp Leu ser Leu Leu val Pro Phe val Gin Trp Phe val Gly Leu 
210 215 220 

Ser Pro Thr val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly 
225 230 235 240 

pro ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie 
245 250 255 

Phe Phe Cys Leu Trp val Tyr lie 
260 

<210> 55 
<211> 1170 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 33-153 + zz (serotype y) sequence 
<400> 55 

atggggacga atctttctgt tcccaatcct ctgggattct ttcccgatca ccagttggac 60 

cctgcgttcg gagccaactc aaacaatcca gattggggcg gccgcgcgca acacgatgaa 120 

gccgtagaca acaaattcaa caaagaacaa caaaacgcgt tctatgagat cttacattta 180 

cctaacttaa acgaagaaca acgaaacgcc ttcatccaaa gtttaaaaga tgacccaagc 240 

caaagcgcta accttttagc agaagctaaa aagctaaatg atgctcaggc gccgaaagta 300 

gacaacaaat tcaacaaaga acaacaaaac gcgttctatg agatcttaca tttacctaac 360 
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ttaaacgaag 


aacaacgaaa 


cgccttcatc 


caaagtttaa 


aagatgaccc 


aagccaaagc 


420 


gctaaccttt 


tagcagaagc 


taaaaagcta 


aatgatgctc 


aggcgccgaa 


agcggccgcc 


480 


cctgcaccga 


acatggagaa 


cacaacatca 


ggattcctag 


gacccctgct 


cgtgttacag 


540 


gcggggtttt 


tcttgttgac 


aagaatcctc 


acaataccac 


agagtctaga 


ctcgtggtgg 


600 


acttctctca 


attttctagg 


gggagcaccc 


acgtgtcctg 


gccaaaattc 


gcagtcccca 


660 


acctccaatc 


actcaccaac 


ctcttgtcct 


ccaatttgtc 


ctggctatcg 


ctggatgtgt 


720 


ctgcggcgtt 


ttatcatatt 


cctcttcatc 


ctgctgctat 


gcctcatctt 


cttgttggtt 


780 


cttctggact 


accaaggtat 


gttgcccgtt 


tgtcctctac 


ttccaggaac 


atcaaccacc 


840 


agcacggggc 


catgcaagac 


ctgcacgatt 


cctgctcaag 


gaacctctat 


gtttccctct 


900 


t* o 1" t" a c t* o t* a 






LyLciL u uy Lei 


ttrrratrrr 


atratrrtnn 


QfiO 


gctttcgcaa 


gattcctatg 


ggagtgggcc 


tcagtccgtt 


tctcctggct 


cagtttacta 


1020 


gtgccatttg 


ttcagtggtt 


cgtagggctt 


tcccccactg 


tttggctttc 


agttatatgg 


1080 


atgatgtggt 


attgggggcc 


aagtctgtac 


aacatcttga 


gtcccttttt 


acctctatta 


1140 


ccaattttct 


tttgtctttg 


ggtatacatt 
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<210> 56 
<211> 390 
<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 33-153 + ZZ (serotype y) sequence 
<400> 56 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Gly Gly Arg Ala Gin His Asp Glu Ala val Asp Asn Lys Phe Asn Lys 
35 40 45 

Glu Gin Gin Asn Ala Phe Tyr Glu He Leu His Leu Pro Asn Leu Asn 
50 55 60 

Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser 
65 70 75 80 

Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin 
85 90 95 
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Ala Pro Lys val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe 
100 105 110 

Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala 
115 120 125 

Phe He Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu 
130 135 140 

Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala 
145 150 155 160 

Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu 
165 170 175 

Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie 
180 185 ~ 190 

Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly 
195 200 205 

Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His 
210 215 220 

Ser Pro Thr ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys 
225 230 235 240 

Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie 
245 250 255 

Phe Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro 
260 265 270 

Leu Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys 
275 280 285 

Thr lie Pro Ala Gin Gly Thr ser Met Phe Pro Ser Cys Cys Cys Thr 
290 295 300 

Lys Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp 
305 310 315 320 

Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala ser val Arg Phe ser Trp 
325 330 335 

Leu Ser Leu Leu Val Pro Phe val Gin Trp Phe val Gly Leu ser Pro 
340 345 350 
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Thr val Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser 
355 360 365 

Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe 
370 375 380 

Cys Leu Trp val Tyr lie 
385 390 

<210> 57 

<211> 1170 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 33-153 (Q129R) + ZZ (serotype y) sequence 



<400> 57 



atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattggggcg 


gccgcgcgca 


acacgatgaa 


120 


gccgtagaca 


acaaattcaa 


caaagaacaa 


caaaacgcgt 


tctatgagat 


cttacattta 


180 


cctaacttaa 


acgaagaaca 


acgaaacgcc 


ttcatccaaa 


gtttaaaaga 


tgacccaagc 


240 


caaagcgcta 


accttttagc 


agaagctaaa 


aagctaaatg 


atgctcaggc 


gecgaaagta 


300 


gacaacaaat 


tcaacaaaga 


acaacaaaac 


gcgttctatg 


agate ttaca 


tt tacc taac 




ttaaacgaag 


aacaacgaaa 


cgccttcatc 


caaagtttaa 


aagatgaccc 


aagecaaage 


420 


gctaaccttt 


tagcagaagc 


taaaaagcta 


aatgatgctc 


aggegecgaa 


agcggccgcc 


480 


cctgcaccga 


acatggagaa 


cacaacatca 


ggattcctag 


gacccctgct 


cgtgttacag 


540 


gcggggtttt 


tcttgttgac 


aagaatcctc 


acaataccac 


agagtctaga 


ctcgtggtgg 


600 


acttctctca 


attttctagg 


gggagcaccc 


acgtgtcctg 


gecaaaatte 


gcagtcccca 


660 


acctccaatc 


actcaccaac 


ctcttgtcct 


ccaatttgtc 


ctggctatcg 


ctggatgtgt 


720 


ctgcggcgtt 


ttatcatatt 


cctcttcatc 


ctgctgctat 


gcctcatctt 


cttgttggtt 


780 


cttctggact 


accaaggtat 


gttgcccgtt 


tgtcctctac 


ttccaggaac 


atcaaccacc 


840 


agcacggggc 


catgcaagac 


ctgcacgatt 


cctgctcgag 


gaacctctat 


gtttccctct 


900 


tgttgctgta 


caaaaccttc 


ggacggaaac 


tgcacttgta 


ttcccatccc 


atcatcctgg 


960 


gctttcgcaa 


gattcctatg 


ggagtgggcc 


tcagtccgtt 


tctcctggct 


cagtttacta 


1020 


gtgccatttg 


ttcagtggtt 


cgtagggctt 


tcccccactg 


tttggctttc 


agttatatgg 


1080 


atgatgtggt 


attgggggcc 


aagtctgtac 


aacatcttga 


gtcccttttt 


acctctatta 


1140 


ccaattttct 


tttgtctttg 


ggtatacatt 


Page 
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<210> 58 
<211> 390 
<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 33-153 (Q129R) + zz (serotype y) 
sequence 

<400> 58 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Gly Gly Arg Ala Gin His Asp Glu Ala Val Asp Asn Lys Phe Asn Lys 
35 40 45 

Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn 
50 55 60 

Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser 
65 70 75 80 

Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin 
85 90 95 

Ala Pro Lys Val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe 
100 105 110 

Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala 
115 120 125 

Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu 
130 135 140 

Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala 
145 150 155 160 

Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu 
165 170 175 

Leu Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie 
180 185 ~ 190 

Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly 
195 200 205 
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Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His 
210 215 220 

Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys 
225 230 ' 235 ~ 240 

Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie 
245 250 255 

Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro 
260 265 * 270 

Leu Leu Pro Gly Thr ser Thr Thr ser Thr Gly Pro Cys Lys Thr Cys 
275 280 285 

Thr lie Pro Ala Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr 
290 ~ 295 300 

Lys Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp 
305 310 315 320 

Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp 
325 330 335 

Leu Ser Leu Leu Val Pro Phe Val Gin Trp Phe val Gly Leu Ser Pro 
340 345 350 

Thr Val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser 
355 360 365 

Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe 
370 375 380 

Cys Leu Trp Val Tyr lie 
385 390 

<210> 59 
<211> 1170 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 33-153 (G145R) + ZZ (serotype y) sequence 
<400> 59 

atggggacga atctttctgt tcccaatcct ctgggattct ttcccgatca ccagttggac 60 

cctgcgttcg gagccaactc aaacaatcca gattggggcg gccgcgcgca acacgatgaa 120 

gccgtagaca acaaattcaa caaagaacaa caaaacgcgt tctatgagat cttacattta 180 
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cctaacttaa 


acgaagaaca 


acgaaacgcc 


ttcatccaaa 


gtttaaaaga 


tgacccaagc 


240 


caaagcgcta 


accttttagc 


agaagctaaa 


aagctaaatg 


atgctcaggc 


gccgaaagta 


300 


gacaacaaat 


tcaacaaaga 


acaacaaaac 


gcgttctatg 


agatcttaca 


tttacctaac 


360 


ttaaacgaag 


aacaacgaaa 


cgccttcatc 


caaagtttaa 


aagatgaccc 


aagccaaagc 


420 


gctaaccttt 


tagcagaagc 


taaaaagcta 


aatgatgctc 


aggcgccgaa 


agcggccgcc 


480 


cctgcaccga 


acatggagaa 


cacaacatca 


ggattcctag 


gacccctgct 


cgtgttacag 


540 


gcggggtttt 


tcttgttgac 


aagaatcctc 


acaataccac 


agagtctaga 


ctcgtggtgg 


600 


acttctctca 


attttctagg 


gggagcaccc 


acgtgtcctg 


gccaaaattc 


gcagtcccca 


660 


acctccaatc 


actcaccaac 


ctcttgtcct 


ccaatttgtc 


ctggctatcg 


ctggatgtgt 


720 


ctgcggcgtt 


ttatcatatt 


cctcttcatc 


ctgctgctat 


gcctcatctt 


cttgttggtt 


780 


cttctggact 


accaaggtat 


gttgcccgtt 


tgtcctctac 


ttccaggaac 


atcaaccacc 


840 


agcacggggc 


catgcaagac 


ctgcacgatt 


cctgctcaag 


gaacctctat 


gtttccctct 


900 


tgttgctgta 


caaaaccttc 


ggacagaaac 


tgcacttgta 


ttcccatccc 


atcatcctgg 


960 






ggagtgggcc 


tcagtccgtt 


tctcctggct 


IwdyLLLdLLd 




gtgccatttg 


ttcagtggtt 


cgtagggctt 


tcccccactg 


tttggctttc 


agttatatgg 


1080 


atgatgtggt 


attgggggcc 


aagtctgtac 


aacatcttga 


gtcccttttt 


acctctatta 


1140 


ccaattttct 


tttgtctttg 


ggtatacatt 








1170 



<210> 60 

<211> 390 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 33-153 (G145R) + zz (serotype y) 
sequence 

<400> 60 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gln Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Gly Gly Arg Ala Gln His Asp Glu Ala Val Asp Asn Lys Phe Asn Lys 
35 40 45 

Glu Gln Gln Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn 
50 55 60 
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Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser 
65 70 75 80 

Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin 
85 90 95 

Ala Pro Lys val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe 
100 105 110 

Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala 
115 120 125 

Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu 
130 135 140 

Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala 
145 150 155 160 

Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu 
165 170 175 

Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie 
180 185 " 190 

Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly 
195 200 205 

Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His 
210 * 215 220 

Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys 
225 230 235 ** 240 

Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie 
245 250 255 

Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro 
260 265 270 

Leu Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys 
275 280 285 

Thr lie Pro Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr 
290 295 300 

Lys Pro Ser Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp 
305 310 315 320 
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Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp 
325 330 335 

Leu Ser Leu Leu Val Pro Phe val Gin Trp Phe val Gly Leu Ser Pro 
340 345 350 

Thr Val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser 



Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe 
370 375 380 

Cys Leu Trp Val Tyr lie 
385 390 

<210> 61 
<211> 1170 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 33-153 (Q129R/G145R) + zz (serotype y) sequence 
<400> 61 

atggggacga atctttctgt tcccaatcct ctgggattct ttcccgatca ccagttggac 60 

cctgcgttcg gagccaactc aaacaatcca gattggggcg gccgcgcgca acacgatgaa 120 

gccgtagaca acaaattcaa caaagaacaa caaaacgcgt tctatgagat cttacattta 180 

cctaacttaa acgaagaaca acgaaacgcc ttcatccaaa gtttaaaaga tgacccaagc 240 

caaagcgcta accttttagc agaagctaaa aagctaaatg atgctcaggc gccgaaagta 300 

gacaacaaat tcaacaaaga acaacaaaac gcgttctatg agatcttaca tttacctaac 360 

ttaaacgaag aacaacgaaa cgccttcatc caaagtttaa aagatgaccc aagccaaagc 420 

gctaaccttt tagcagaagc taaaaagcta aatgatgctc aggcgccgaa agcggccgcc 480 

cctgcaccga acatggagaa cacaacatca ggattcctag gacccctgct cgtgttacag 540 

gcggggtttt tcttgttgac aagaatcctc acaataccac agagtctaga ctcgtggtgg 600 

acttctctca attttctagg gggagcaccc acgtgtcctg gccaaaattc gcagtcccca 660 

acctccaatc actcaccaac ctcttgtcct ccaatttgtc ctggctatcg ctggatgtgt 720 

ctgcggcgtt ttatcatatt cctcttcatc ctgctgctat gcctcatctt cttgttggtt 780 

cttctggact accaaggtat gttgcccgtt tgtcctctac ttccaggaac atcaaccacc 840 

agcacggggc catgcaagac ctgcacgatt cctgctcgag gaacctctat gtttccctct 900 

tgttgctgta caaaaccttc ggacagaaac tgcacttgta ttcccatccc atcatcctgg 960 

gctttcgcaa gattcctatg ggagtgggcc tcagtccgtt tctcctggct cagtttacta 1020 
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gtgccatttg ttcagtggtt cgtagggctt tcccccactg tttggctttc agttatatgg 1080 

atgatgtggt attgggggcc aagtctgtac aacatcttga gtcccttttt acctctatta 1140 

ccaattttct tttgtctttg ggtatacatt 1170 

<210> 62 

<211> 390 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 33-153 (Q129R/G145R) + zz (serotype y) 
sequence 

<400> 62 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Gly Gly Arg Ala Gin His Asp Glu Ala val Asp Asn Lys Phe Asn Lys 
35 40 45 

Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn 
50 55 60 

Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser 
65 70 75 80 

Gin ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin 
85 90 95 

Ala Pro Lys val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe 
100 105 110 

Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala 
115 120 125 

Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu 
130 135 140 

Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala 
145 150 155 160 

Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu 
165 170 175 

Page 41 



sequence listing_ST25.txt 
Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie 
180 185 190 

Pro Gin Ser Leu Asp Ser Trp Trp Thr ser Leu Asn Phe Leu Gly Gly 
195 200 205 

Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His 
210 215 220 

ser Pro Thr ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys 
225 230 235 " 240 

Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie 
245 250 255 

Phe Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro 
260 265 270 

Leu Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys 
275 280 285 

Thr lie Pro Ala Arg Gly Thr ser Met Phe Pro ser Cys Cys Cys Thr 
290 & 295 300 

Lys Pro Ser Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser ser Trp 
305 310 315 320 

Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe ser Trp 
325 330 335 

Leu ser Leu Leu Val Pro Phe Val Gin Trp Phe val Gly Leu ser Pro 
340 345 350 

Thr Val Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser 
355 360 365 

Leu Tyr Asn He Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe 
370 375 380 

Cys Leu Trp Val Tyr lie 
385 390 

<210> 63 

<211> 969 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 33-153 + EGF (serotype y) sequence 
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<400> 63 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattggggcg 


gccgcatgaa 


ctctgattcc 


120 


gaatgcccgc 


tgtctcatga 


cggttactgc 


ctgcatgatg 


gcgtatgcat 


gtacatcgaa 


180 


gctctggaca 


aatacgcatg 


caactgtgtt 


gtaggttaca 


tcggcgaacg 


ttgccagtat 


240 


cgcgacctga 


aatggtggga 


actgcgtaag 


gcggccgccc 


ctgcaccgaa 


catggagaac 


300 


acaacatcag 


gattcctagg 


acccctgctc 


gtgttacagg 


cggggttttt 


cttgttgaca 


360 


agaatcctca 


caataccaca 


gagtctagac 


tcgtggtgga 


cttctctcaa 


ttttctaggg 


420 


ggagcaccca 


cgtgtcctgg 


ccaaaattcg 


cagtccccaa 


cctccaatca 


ctcaccaacc 


480 


tcttgtcctc 


caatttgtcc 


tggctatcgc 


tggatgtgtc 


tgcggcgttt 


tatcatattc 


540 


ctcttcatcc 


tgctgctatg 


cctcatcttc 


ttgttggttc 


ttctggacta 


ccaaggtatg 


600 


ttgcccgttt 


gtcctctact 


tccaggaaca 


tcaaccacca 


gcacggggcc 


atgcaagacc 


660 


tgcacgattc 


ctgctcaagg 


aacctctatg 


tttccctctt 


gttgctgtac 


aaaaccttcg 


720 


yo.v_yycicici.v- l 


y v_cn~ l i»y l. a l 


L\_l_v_dLv~l_v_cl 


LLaLLL Ly y y 


rtttrnraan 
lll Li_y Lady 


d l lll La Ly y 




gagtgggcct 


cagtccgttt 


ctcctggctc 


agtttactag 


tgccatttgt 


tcagtggttc 


840 


gtagggcttt 


cccccactgt 


ttggctttca 


gttatatgga 


tgatgtggta 


ttgggggcca 


900 


agtctgtaca 


acatcttgag 


tcccttttta 


cctctattac 


caattttctt 


ttgtctttgg 


960 


gtatacatt 












969 



<210> 64 
<211> 323 
<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 33-153 + EGF (serotype y). sequence 
<400> 64 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Gly Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly 
35 40 45 

Tyr Cys Leu His Asp Gly Val Cys Met Tyr lie Glu Ala Leu Asp Lys 
50 55 60 
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Tyr Ala Cys Asn Cys val val Gly Tyr lie Gly Glu Arg Cys Gin Tyr 
65 70 * 75 80 

Arg Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro 
85 90 95 

Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu 
100 105 110 

Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser 
115 120 125 

Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr 
130 135 140 

Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr 
145 150 155 160 

Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg 
165 170 175 

Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu 
180 185 190 



val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro 
195 200 205 

Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro 
210 215 220 

Ala Gin Gly Thr Ser Met Phe Pro ser Cys Cys Cys Thr Lys Pro Ser 
225 230 235 240 

Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala 
245 250 255 

Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu 
260 265 270 

Leu Val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp 
275 280 285 

Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn 
290 295 300 

lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp 
305 310 315 320 

Page 44 



sequence listing_ST25.txt 

val Tyr lie 

<210> 65 

<211> 969 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 33-153 (Q129R) + EGF (serotype y) sequence 



<400> 65 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattggggcg 


gccgcatgaa 


ctctgattcc 


120 


gaatgcccgc 


tgtctcatga 


cggttactgc 


ctgcatgatg 


gcgtatgcat 


gtacatcgaa 


180 


gctctggaca 


aatacgcatg 


caactgtgtt 


gtaggttaca 


tcggcgaacg 


ttgccagtat 


240 


cgcgacctga 


aatggtggga 


actgcgtaag 


gcggccgccc 


ctgcaccgaa 


catggagaac 


300 


acaacatcag 


gattcctagg 


acccctgctc 


gtgttacagg 


cggggttttt 


cttgttgaca 


360 


3 /"I ^ "3 "t~ /~ "t" ^ 

ayaaLLCICd 


CddtdCCaCd 


gag xc ragac 


rcgrgg tgga 


c llctc ucaa 


tu tcraggg 


Aid 


ggagcaccca 


cgtgtcctgg 


ccaaaattcg 


cagtccccaa 


cctccaatca 


ctcaccaacc 


480 


tcttgtcctc 


caatttgtcc 


tggctatcgc 


tggatgtgtc 


tgcggcgttt 


tatcatattc 


540 


ctcttcatcc 


tgctgctatg 


cctcatcttc 


ttgttggttc 


ttctggacta 


ccaaggtatg 


600 


ttgcccgttt 


gtcctctact 


tccaggaaca 


tcaaccacca 


gcacggggcc 


atgcaagacc 


660 


tgcacgattc 


ctgctcgagg 


aacctctatg 


tttccctctt 


gttgctgtac 


aaaaccttcg 


720 


gacggaaact 


gcacttgtat 


tcccatccca 


tcatcctggg 


ctttcgcaag 


attcctatgg 


780 


gagtgggcct 


cagtccgttt 


ctcctggctc 


agtttactag 


tgccatttgt 


tcagtggttc 


840 


gtagggcttt 


cccccactgt 


ttggctttca 


gttatatgga 


tgatgtggta 


ttgggggcca 


900 


agtctgtaca 


acatcttgag 


tcccttttta 


cctctattac 


caattttctt 


ttgtctttgg 


960 


gtatacatt 












969 



<210> 66 

<211> 323 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 33-153 (Q129R) + EGF (serotype y) 
sequence 

<400> 66 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
1 5 10 15 
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His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Gly Gly Arg Met Asn ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly 
35 40 45 

Tyr Cys Leu His Asp Gly val Cys Met Tyr lie Glu Ala Leu Asp Lys 
50 55 60 

Tyr Ala Cys Asn Cys Val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr 
65 ' 70 * * 75 80 

Arg Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro 
85 90 95 

Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu 
100 105 110 

Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin ser 
115 120 ~ 125 

Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr 
130 135 140 

Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr 
145 150 155 160 

Ser cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met cys Leu Arg Arg 
165 170 175 

Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu 
180 185 190 

val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro 
195 200 205 

Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro 
210 215 220 

Ala Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser 
225 230 235 * 240 

Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser ser Trp Ala Phe Ala 
245 250 255 

Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu 
260 265 270 
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Leu val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp 
275 280 285 

Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn 
290 295 300 

lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp 
305 310 315 320 

val Tyr lie 



<210> 67 

<211> 969 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 33-153 (G145R) + EGF (serotype y) sequence 

<400> 67 



atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattggggcg 


gccgcatgaa 


ctctgattcc 




gaatgcccgc 


tgtctcatga 


cggttactgc 


ctgcatgatg 


gcgtatgcat 


gtacatcgaa 


180 


gctctggaca 


aatacgcatg 


caactgtgtt 


gtaggttaca 


tcggcgaacg 


ttgccagtat 


240 


cgcgacctga 


aatggtggga 


actgcgtaag 


gcggccgccc 


ctgcaccgaa 


catggagaac 


300 


acaacatcag 


gattcctagg 


acccctgctc 


gtgttacagg 


cggggttttt 


cttgttgaca 


360 


agaatcctca 


caataccaca 


gagtctagac 


tcgtggtgga 


cttctctcaa 


ttttctaggg 


420 


ggagcaccca 


cgtgtcctgg 


ccaaaattcg 


cagtccccaa 


cctccaatca 


ctcaccaacc 


480 


tcttgtcctc 


caatttgtcc 


tggctatcgc 


tggatgtgtc 


tgcggcgttt 


tatcatattc 


540 


ctcttcatcc 


tgctgctatg 


cctcatcttc 


ttgttggttc 


ttctggacta 


ccaaggtatg 


600 


ttgcccgttt 


gtcctctact 


tccaggaaca 


tcaaccacca 


gcacggggcc 


atgcaagacc 


660 


tgcacgattc 


ctgctcaagg 


aacctctatg 


tttccctctt 


gttgctgtac 


aaaaccttcg 


720 


gacagaaact 


gcacttgtat 


tcccatccca 


tcatcctggg 


ctttcgcaag 


attcctatgg 


780 


gagtgggcct 


cagtccgttt 


ctcctggctc 


agtttactag 


tgccatttgt 


tcagtggttc 


840 


gtagggcttt 


cccccactgt 


ttggctttca 


gttatatgga 


tgatgtggta 


ttgggggcca 


900 


agtctgtaca 


acatcttgag 


tcccttttta 


cctctattac 


caattttctt 


ttgtctttgg 


960 


gtatacatt 












969 


<210> 68 
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<211> 323 
<212> PRT 

<213> arti f i ci al sequence 
<220> 

<223> Proteins corresponding to 33-153 (G145R) + EGF (serotype y) 
sequence 

<400> 68 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Gly Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly 
35 40 45 

Tyr Cys Leu His Asp Gly val Cys Met Tyr lie Glu Ala Leu Asp Lys 
50 55 60 

Tyr Ala Cys Asn Cys Val val Gly Tyr lie Gly Glu Arg cys Gin Tyr 
65 70 75 ~ 80 

Arg Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro 
85 90 95 

Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu 
100 105 110 

Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser 
115 120 125 

Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr 
130 135 140 

Cys Pro Gly Gin Asn ser Gin ser Pro Thr ser Asn His ser Pro Thr 
145 150 155 160 

Ser cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg 
165 170 175 

Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu 
180 185 190 

Val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro 
195 200 205 

Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro 
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210 215 220 

Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser 
225 230 235 240 

Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala 
245 250 255 

Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu 
260 265 270 

Leu val Pro Phe val Gin Trp Phe Val Gly Leu Ser Pro Thr val Trp 
275 280 285 

Leu Ser val He Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn 
290 295 300 

lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp 
305 310 315 320 

val Tyr He 



<210> 69 
<211> 969 
<212> DNA 

<213> artificial sequence 










<220> 

<223> 33-153 (Q129R/G145R) + EGF (serotype 


y) sequence 




<400> 69 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattggggcg 


gccgcatgaa 


ctctgattcc 


120 


gaatgcccgc 


tgtctcatga 


cggttactgc 


ctgcatgatg 


gcgtatgcat 


gtacatcgaa 


180 


gctctggaca 


aatacgcatg 


caactgtgtt 


gtaggttaca 


tcggcgaacg 


ttgccagtat 


240 


cgcgacctga 


aatggtggga 


actgcgtaag 


gcggccgccc 


ctgcaccgaa 


catggagaac 


300 


acaacatcag 


gattcctagg 


acccctgctc 


gtgttacagg 


cggggttttt 


cttgttgaca 


360 


agaatcctca 


caataccaca 


gagtctagac 


tcgtggtgga 


cttctctcaa 


ttttctaggg 


420 


ggagcaccca 


cgtgtcctgg 


ccaaaattcg 


cagtccccaa 


cctccaatca 


ctcaccaacc 


480 


tcttgtcctc 


caatttgtcc 


tggctatcgc 


tggatgtgtc 


tgcggcgttt 


tatcatattc 


540 


ctcttcatcc 


tgctgctatg 


cctcatcttc 


ttgttggttc 


ttctggacta 


ccaaggtatg 


600 


ttgcccgttt 


gtcctctact 


tccaggaaca 


tcaaccacca 


gcacggggcc 


atgcaagacc 


660 


tgcacgattc 


ctgctcgagg 


aacctctatg 


tttccctctt 
Page 


gttgctgtac 
49 


aaaaccttcg 


720 
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gacagaaact gcacttgtat tcccatccca tcatcctggg ctttcgcaag attcctatgg 780 

gagtgggcct cagtccgttt ctcctggctc agtttactag tgccatttgt tcagtggttc 840 

gtagggcttt cccccactgt ttggctttca gttatatgga tgatgtggta ttgggggcca 900 

agtctgtaca acatcttgag tcccttttta cctctattac caattttctt ttgtctttgg 960 

gtatacatt 969 

<210> 70 

<211> 323 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 33-153 (Q129R/G145R) + EGF (serotype y) 
sequence 

<400> 70 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
1 5 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Gly Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly 
35 40 45 

Tyr Cys Leu His Asp Gly Val Cys Met Tyr lie Glu Ala Leu Asp Lys 
50 55 60 

Tyr Ala Cys Asn Cys Val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr 
65 70 75 80 

Arg Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro 
85 90 95 

Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu 
100 105 110 

Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser 
115 120 125 

Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr 
130 135 140 

Cys Pro Gly Gin Asn ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr 
145 150 155 160 
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Ser cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg 
165 170 175 

Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu 
180 185 190 

Val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro 
195 200 205 

Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro 
210 215 220 

Ala Arg Gly Thr ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser 
225 230 235 240 

Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser ser Trp Ala Phe Ala 
245 250 255 

Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu 
260 265 ~ 270 

Leu Val Pro Phe Val Gin Trp Phe val Gly Leu Ser Pro Thr val Trp 
275 280 285 

Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn 
290 295 300 

lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp 
305 310 315 320 

val Tyr lie 



<210> 71 
<211> 828 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 33-153 + STR-tag (serotype y) sequence 
<400> 71 

atggggacga atctttctgt tcccaatcct ctgggattct ttcccgatca ccagttggac 60 
cctgcgttcg gagccaactc aaacaatcca gattggggcg gccgctggag ccacccgcag 120 
ttcgaaaaag cggccgcccc tgcaccgaac atggagaaca caacatcagg attcctagga 180 
cccctgctcg tgttacaggc ggggtttttc ttgttgacaa gaatcctcac aataccacag 240 
agtctagact cgtggtggac ttctctcaat tttctagggg gagcacccac gtgtcctggc 300 
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caaaattcgc 


agtccccaac 


ctccaatcac 


tcaccaacct 


cttgtcctcc aatttgtcct 


360 


ggctatcgct 


ggatgtgtct 


gcggcgtttt 


atcatattcc 


tcttcatcct 


gctgctatgc 


420 


ctcatcttct 


tgttggttct 


tctggactac 


caaggtatgt 


tgcccgtttg 


tcctctactt 


480 


ccaggaacat 


caaccaccag 


cacggggcca 


tgcaagacct 


gcacgattcc 


tgctcaagga 


540 


acctctatgt 


ttccctcttg 


ttgctgtaca 


aaaccttcgg 


acggaaactg 


cacttgtatt 


600 


cccatcccat 


catcctgggc 


tttcgcaaga 


ttcctatggg 


agtgggcctc agtccgtttc 


660 


tcctggctca 


gtttactagt 


gccatttgtt 


cagtggttcg 


tagggctttc 


ccccactgtt 


720 


tggctttcag 


ttatatggat 


gatgtggtat 


tgggggccaa 


gtctgtacaa 


catcttgagt 


780 


ccctttttac 


ctctattacc 


aattttcttt 


tgtctttggg 


tatacatt 




828 



<210> 72 
<211> 276 
<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 33-153 + STR-tag (serotype y) sequence 
<400> 72 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Gly Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala 
35 40 45 

Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val 
50 55 60 

Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin 
65 70 75 80 

Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro 
85 90 95 

Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro 
100 105 110 

Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg 
115 120 125 

Arg Phe He lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu 
130 135 140 

Page 52 



» « 



sequence listing_ST25.txt 

Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu 

145 150 155 160 

Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie 

165 170 175 

pro Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro 

180 185 190 

Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe 

195 200 205 

Ala Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser 

210 215 220 



Leu Leu val Pro Phe Val Gin Trp Phe val Gly Leu Ser Pro Thr val 
225 230 235 240 

Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro ser Leu Tyr 
245 250 255 

Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu 
260 265 270 

Trp val Tyr lie 
275 



<210> 73 
<211> 828 
<212> DNA 

<213> arti f i ci al sequence 










<220> 

<223> 33-153 (Q129R) 


+ STR-tag i 


(serotype y] 


i sequence 






<400> 73 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattggggcg 


gccgctggag 


ccacccgcag 


120 


ttcgaaaaag 


cggccgcccc 


tgcaccgaac 


atggagaaca 


caacatcagg 


attcctagga 


180 


cccctgctcg 


tgttacaggc 


ggggtttttc 


ttgttgacaa 


gaatcctcac 


aataccacag 


240 


agtctagact 


cgtggtggac 


ttctctcaat 


tttctagggg 


gagcacccac 


gtgtcctggc 


300 


caaaattcgc 


agtccccaac 


ctccaatcac 


tcaccaacct 


cttgtcctcc 


aatttgtcct 


360 


ggctatcgct 


ggatgtgtct 


gcggcgtttt 


atcatattcc 


tcttcatcct 


gctgctatgc 


420 


ctcatcttct 


tgttggttct 


tctggactac 


caaggtatgt 
Page 


tgcccgtttg 
53 


tcctctactt 


480 



sequence listing_ST25.txt 
ccaggaacat caaccaccag cacggggcca tgcaagacct gcacgattcc tgctcgagga 

acctctatgt ttccctcttg ttgctgtaca aaaccttcgg acggaaactg cacttgtatt 

cccatcccat catcctgggc tttcgcaaga ttcctatggg agtgggcctc agtccgtttc 

tcctggctca gtttactagt gccatttgtt cagtggttcg tagggctttc ccccactgtt 

tggctttcag ttatatggat gatgtggtat tgggggccaa gtctgtacaa catcttgagt 

ccctttttac ctctattacc aattttcttt tgtctttggg tatacatt 

<210> 74 

<211> 276 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 33-153 (Q129R) + STR-tag (serotype y) 
sequence 

<400> 74 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn ser Asn Asn Pro Asp Trp 
20 25 30 

Gly Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala 
35 40 45 

Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val 
50 55 60 

Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin 
65 70 75 80 

Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro 
85 90 95 

Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro 
100 105 110 

Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg 
115 120 125 

Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu 
130 135 140 

Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu 
145 150 155 160 
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Pro Gly Thr ser Thr Thr ser Thr Gly Pro Cys Lys Thr cys Thr He 
165 170 175 

Pro Ala Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro 
180 185 190 

Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe 
195 200 205 

Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser 
210 215 220 

Leu Leu val Pro Phe val Gin Trp Phe val Gly Leu ser Pro Thr val 
225 230 235 240 

Trp Leu Ser val He Trp Met Met Trp Tyr Trp Gly Pro ser Leu Tyr 
245 250 255 

Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu 
260 265 270 

Trp val Tyr lie 
275 



<210> 75 

<211> 828 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 33-153 (G145R) + STR-tag (serotype y) sequence 

<400> 75 



atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattggggcg 


gccgctggag 


ccacccgcag 


120 


ttcgaaaaag 


cggccgcccc 


tgcaccgaac 


atggagaaca 


caacatcagg 


attcctagga 


180 


cccctgctcg 


tgttacaggc 


ggggtttttc 


ttgttgacaa 


gaatcctcac 


aataccacag 


240 


agtctagact 


cgtggtggac 


ttctctcaat 


tttctagggg 


gagcacccac 


gtgtcctggc 


300 


caaaattcgc agtccccaac 


ctccaatcac 


tcaccaacct 


cttgtcctcc 


aatttgtcct 


360 


ggctatcgct 


ggatgtgtct 


gcggcgtttt 


atcatattcc 


tcttcatcct 


gctgctatgc 


420 


ctcatcttct 


tgttggttct 


tctggactac 


caaggtatgt 


tgcccgtttg 


tcctctactt 


480 


ccaggaacat 


caaccaccag 


cacggggcca 


tgcaagacct 


gcacgattcc 


tgctcaagga 


540 


acctctatgt 


ttccctcttg 


ttgctgtaca 


aaaccttcgg 


acagaaactg 


cacttgtatt 


600 


cccatcccat 


catcctgggc 


tttcgcaaga 


ttcctatggg 


agtgggcctc 


agtccgtttc 


660 
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tcctggctca gtttactagt gccatttgtt cagtggttcg tagggctttc ccccactgtt 720 

tggctttcag ttatatggat gatgtggtat tgggggccaa gtctgtacaa catcttgagt 780 

ccctttttac ctctattacc aattttcttt tgtctttggg tatacatt 828 

<210> 76 

<211> 276 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 33-153 (G145R) + STR-tag (serotype y) 
sequence 

<400> 76 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Gly Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala 
35 40 45 

Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val 
50 55 60 

Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin 
65 70 75 80 

Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro 
85 90 95 

Thr Cys Pro Gly Gin Asn ser Gin Ser Pro Thr ser Asn His Ser Pro 
100 105 110 

Thr Ser cys Pro Pro lie cys Pro Gly Tyr Arg Trp Met Cys Leu Arg 
115 120 125 

Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu 
130 135 140 

Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu 
145 150 155 160 

Pro Gly Thr ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie 
165 170 175 
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Pro Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro 
180 185 190 



Ser Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe 
195 200 205 



Ala Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser 
210 215 220 



Leu Leu val Pro Phe val Gin Trp Phe val Gly Leu Ser Pro Thr val 
225 230 235 240 



Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr 
245 250 255 



Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu 
260 265 270 



Trp val Tyr lie 
275 



<210> 77 

<211> 828 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 33-153 (Q129R/G145R) + STR-tag (serotype y) sequence 

<400> 77 



atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattggggcg 


gccgctggag 


ccacccgcag 


120 


ttcgaaaaag 


cggccgcccc 


tgcaccgaac 


atggagaaca 


caacatcagg 


attcctagga 


180 


cccctgctcg 


tgttacaggc 


ggggtttttc 


ttgttgacaa 


gaatcctcac 


aataccacag 


240 


agtctagact 


cgtggtggac 


ttctctcaat 


tttctagggg 


gagcacccac 


gtgtcctggc 


300 


caaaattcgc 


agtccccaac 


ctccaatcac 


tcaccaacct 


cttgtcctcc 


aatttgtcct 


360 


ggctatcgct 


ggatgtgtct 


gcggcgtttt 


atcatattcc 


tcttcatcct 


gctgctatgc 


420 


ctcatcttct 


tgttggttct 


tctggactac 


caaggtatgt 


tgcccgtttg 


tcctctactt 


480 


ccaggaacat 


caaccaccag 


cacggggcca 


tgcaagacct 


gcacgattcc 


tgctcgagga 


540 


acctctatgt 


ttccctcttg 


ttgctgtaca 


aaaccttcgg 


acagaaactg 


cacttgtatt 


600 


cccatcccat 


catcctgggc 


tttcgcaaga 


ttcctatggg 


agtgggcctc 


agtccgtttc 


660 


tcctggctca 


gtttactagt 


gccatttgtt 


cagtggttcg 


tagggctttc 


ccccactgtt 


720 


tggctttcag 


ttatatggat 


gatgtggtat 


tgggggccaa 


gtctgtacaa 


catcttgagt 


780 
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ccctttttac ctctattacc aattttcttt tgtctttggg tatacatt 828 

<210> 78 

<211> 276 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 33-153 (Q129R/G145R) + STR-tag 
(serotype y) sequence 

<400> 78 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Gly Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala 
35 40 45 

Pro Asn Met Glu Asn Thr Thr ser Gly Phe Leu Gly Pro Leu Leu Val 
50 55 60 

Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin 
65 70 75 80 

Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro 
85 90 95 

Thr cys Pro Gly Gin Asn Ser Gin Ser Pro Thr ser Asn His ser Pro 
100 105 110 

Thr Ser Cys Pro Pro He Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg 
115 120 125 

Arg Phe lie lie Phe Leu Phe He Leu Leu Leu Cys Leu lie Phe Leu 
130 135 140 

Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu 
145 150 155 160 

Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie 
165 170 175 

Pro Ala Arg Gly Thr ser Met Phe Pro Ser cys Cys Cys Thr Lys Pro 
180 185 190 

Ser Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser ser Trp Ala Phe 
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195 200 205 



Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser 
210 215 220 



Leu Leu Val Pro Phe Val Gin Trp Phe val Gly Leu Ser Pro Thr Val 
225 230 235 240 



Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr 
245 250 255 



Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu 
260 265 270 



Trp val Tyr lie 
275 



<210> 79 

<211> 1221 

<212> DNA 

<213> artificial sequence 



<220> 

<223> 50-153 + ZZ (serotype y) sequence 



<400> 79 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattgggact 


tcaaccccaa 


caaggatcaa 


120 


tggccagagg 


caaatcaggt 


aggagcgggc 


ggccgcgcgc 


aacacgatga 


agccgtagac 


180 


aacaaattca 


acaaagaaca 


acaaaacgcg 


ttctatgaga 


tcttacattt 


acctaactta 


240 


aacgaagaac 


aacgaaacgc 


cttcatccaa 


agtttaaaag 


atgacccaag 


ccaaagcgct 


300 


aaccttttag 


cagaagctaa 


aaagctaaat 


gatgctcagg 


cgccgaaagt 


agacaacaaa 


360 


ttcaacaaag 


aacaacaaaa 


cgcgttctat 


gagatcttac 


atttacctaa 


cttaaacgaa 


420 


gaacaacgaa 


acgccttcat 


ccaaagttta 


aaagatgacc 


caagccaaag 


cgctaacctt 


480 


ttagcagaag 


ctaaaaagct 


aaatgatgct 


caggcgccga 


aagcggccgc 


ccctgcaccg 


540 


aacatggaga 


acacaacatc 


aggattccta 


ggacccctgc 


tcgtgttaca 


ggcggggttt 


600 


ttcttgttga 


caagaatcct 


cacaatacca 


cagagtctag 


actcgtggtg 


gacttctctc 


660 


aattttctag 


ggggagcacc 


cacgtgtcct 


ggccaaaatt 


cgcagtcccc 


aacctccaat 


720 


cactcaccaa 


cctcttgtcc 


tccaatttgt 


cctggctatc 


gctggatgtg 


tctgcggcgt 


780 


tttatcatat 


tcctcttcat 


cctgctgcta 


tgcctcatct 


tcttgttggt 


tcttctggac 


840 


taccaaggta 


tgttgcccgt 


ttgtcctcta 


cttccaggaa 


catcaaccac 


cagcacgggg 


900 


ccatgcaaga 


cctgcacgat 


tcctgctcaa 


ggaacctcta 
Page 


tgtttccctc 
59 


ttgttgctgt 


960 



sequence listing_ST25.txt 

acaaaacctt cggacggaaa ctgcacttgt attcccatcc catcatcctg ggctttcgca 1020 

agattcctat gggagtgggc ctcagtccgt ttctcctggc tcagtttact agtgccattt 1080 

gttcagtggt tcgtagggct ttcccccact gtttggcttt cagttatatg gatgatgtgg 1140 

tattgggggc caagtctgta caacatcttg agtccctttt tacctctatt accaattttc 1200 

ttttgtcttt gggtatacat t 1221 

<210> 80 

<211> 407 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 50-153 + ZZ (serotype y) sequence 

<400> 80 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
1 5 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Asp Phe Asn Pro Asn Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly 
35 40 45 

Ala Gly Gly Arg Ala Gin His Asp Glu Ala val Asp Asn Lys Phe Asn 
50 ~ 55 60 

Lys Glu Gin Gin Asn Ala Phe Tyr Glu He Leu His Leu Pro Asn Leu 
65 70 75 80 

Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro 
85 90 95 

Ser Gin ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala 
100 105 110 

Gin Ala Pro Lys val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala 
115 120 125 

Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn 
130 135 140 

Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu 
145 150 155 160 

Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala 
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165 170 175 

Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro 
180 185 ' 190 

Leu Leu Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr 
195 200 205 

lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly 
210 215 220 

Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn 
225 230 235 240 

His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met 
245 250 255 

Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu 
260 265 270 

lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys 
275 280 285 

Pro Leu Leu Pro Gly Thr Ser Thr Thr ser Thr Gly Pro Cys Lys Thr 
290 295 300 

Cys Thr lie Pro Ala Gin Gly Thr ser Met Phe Pro Ser Cys Cys Cys 
305 310 315 320 

Thr Lys Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser 
325 330 335 

Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser 
340 ~ 345 350 

Trp Leu Ser Leu Leu val Pro Phe Val Gin Trp Phe val Gly Leu Ser 
355 360 365 

Pro Thr val Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro 
370 375 380 

Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe 
385 390 395 400 

Phe Cys Leu Trp val Tyr lie 
405 
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<210> 81 

<211> 1221 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 50-153 (Q129R) + ZZ (serotype y) sequence 

<400> 81 



atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattgggact 


tcaaccccaa 


caaggatcaa 


120 


tggccagagg 


caaatcaggt 


aggageggge 


ggccgcgcgc 


aacacgatga 


ageegtagae 


180 


aacaaattca 


acaaagaaca 


acaaaacgeg 


ttctatgaga 


tcttacattt 


acctaactta 


240 


aacgaagaac 


aacgaaacgc 


cttcatccaa 


agtttaaaag 


atgacccaag 


ccaaagcgct 


300 


aaccttttag 


cagaagctaa 


aaagctaaat 


gatgetcagg 


cgccgaaagt 


agacaacaaa 


360 


ttcaacaaag 


aacaacaaaa 


cgcgttctat 


gagatcttac 


atttacctaa 


cttaaacgaa 


420 


gaacaacgaa 


acgccttcat 


ccaaagttta 


aaagatgacc 


caagecaaag 


cgctaacctt 


480 


ttagcagaag 


ctaaaaagct 


aaatgatget 


caggcgccga 


aagcggccgc 


ccctgcaccg 


540 


aacatggaga 


acacaacatc 


aggattccta 


ggacccctgc 


tcgtgttaca 


ggcggggttt 


600 


ttcttgttga 


caagaatcct 


cacaatacca 


cagagtctag 


actcgtggtg 


gacttctctc 


660 


aattttctag 


ggggagcacc 


cacgtgtcct 


ggecaaaatt 


cgcagtcccc 


aacctccaat 


720 


LdLLCdLLdd 


cccc Lcgt.cc 


trraatttnt 


CLUyyL.l_d.LL. 


yLLyydLyLy 




/ Ou 


tttatcatat 


tcctcttcat 


cctgctgcta 


tgcctcatct 


tcttgttggt 


tcttctggac 


840 


taccaaggta 


tgttgcccgt 


ttgtcctcta 


cttccaggaa 


catcaaccac 


cagcaegggg 


900 


ccatgcaaga 


cctgcacgat 


tcctgctcga 


ggaacctcta 


tgtttccctc 


ttgttgctgt 


960 


acaaaacctt 


eggaeggaaa 


ctgcacttgt 


attcccatcc 


catcatcctg 


ggctttcgea 


1020 


agattcctat 


gggagtgggc 


ctcagtccgt 


ttctcctggc 


tcagtttact 


agtgccattt 


1080 


gttcagtggt 


tegtaggget 


ttcccccact 


gtttggcttt 


cagttatatg 


gatgatgtgg 


1140 


tattgggggc 


caagtctgta 


caacatcttg 


agtccctttt 


tacctctatt 


accaattttc 


1200 


ttttgtcttt 


gggtatacat 


t 








1221 



<210> 82 

<211> 407 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 50-153 (Q129R) + ZZ (serotype y) 
sequence 

<400> 82 
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Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Asp Phe Asn Pro Asn Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly 
35 40 45 

Ala Gly Gly Arg Ala Gin His Asp Glu Ala val Asp Asn Lys Phe Asn 
50 ~ 55 60 

Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu 
65 70 75 80 

Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro 
85 90 95 

Ser Gin ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala 
100 105 110 

Gin Ala Pro Lys val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala 
115 120 125 

Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn 
130 135 140 

Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu 
145 150 155 160 

Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala 
165 170 175 

Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro 
180 185 190 

Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr 
195 200 205 

lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly 
210 215 220 

Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin ser Pro Thr Ser Asn 
225 230 235 240 

His Ser Pro Thr ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met 
245 250 255 
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Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu 
260 265 270 

lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys 
275 280 285 

Pro Leu Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr 
290 295 300 

Cys Thr lie Pro Ala Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys 
305 310 * 315 320 

Thr Lys Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser ser 
325 ' 330 335 

Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser 
340 345 350 

Trp Leu Ser Leu Leu val Pro Phe val Gin Trp Phe val Gly Leu Ser 
355 360 365 

Pro Thr Val Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro 
370 375 380 

Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe 
385 * 390 395 400 

Phe Cys Leu Trp val Tyr lie 
405 



<210> 
<211> 
<212> 
<213> 


83 

1221 
DNA 

arti f i ci al sequence 










<220> 
<223> 


50-153 (G145R) 


+ ZZ (serotype y) sequence 






<400> 83 

atggggacga atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg gagccaactc 


aaacaatcca 


gattgggact 


tcaaccccaa 


caaggatcaa 


120 


tggccagagg caaatcaggt 


aggagcgggc 


ggccgcgcgc 


aacacgatga 


agccgtagac 


180 


aacaaattca acaaagaaca 


acaaaacgcg 


ttctatgaga 


tcttacattt 


acctaactta 


240 


aacgaagaac aacgaaacgc 


cttcatccaa 


agtttaaaag 


atgacccaag 


ccaaagcgct 


300 


aaccttttag cagaagctaa 


aaagctaaat 


gatgctcagg 


cgccgaaagt 


agacaacaaa 


360 


ttcaacaaag aacaacaaaa 


cgcgttctat 


gagatcttac 


atttacctaa 


cttaaacgaa 


420 
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gaacaacgaa 


acgccttcat 


ccaaagttta 


aaagatgacc 


caagccaaag 


cgctaacctt 


480 


ttagcagaag 


ctaaaaagct 


aaatgatgct 


caggcgccga 


aagcggccgc 


ccctgcaccg 


540 


aacatggaga 


acacaacatc 


aggattccta 


ggacccctgc 


tcgtgttaca 


ggcggggttt 


600 


ttcttgttga 


caagaatcct 


cacaatacca 


cagagtctag 


actcgtggtg 


gacttctctc 


660 


aattttctag 


ggggagcacc 


cacgtgtcct 


ggccaaaatt 


cgcagtcccc 


aacctccaat 


720 


cactcaccaa 


cctcttgtcc 


tccaatttgt 


cctggctatc 


gctggatgtg 


tctgcggcgt 


780 


tttatcatat 


tcctcttcat 


cctgctgcta 


tgcctcatct 


tcttgttggt 


tcttctggac 


840 


taccaaggta 


tgttgcccgt 


ttgtcctcta 


cttccaggaa 


catcaaccac 


cagcacgggg 


900 


ccatgcaaga 


cctgcacgat 


tcctgctcaa 


ggaacctcta 


tgtttccctc 


ttgttgctgt 


960 


acaaaacctt 


cggacagaaa 


ctgcacttgt 


attcccatcc 


catcatcctg 


ggctttcgca 


1020 


aaattcctat 


aaaaataaac 
yyy«y *-yyy v - 


ctcagtccgt 


ttctcctggc 


tcagtttact 


aataccattt 


1080 


gttcagtggt 


tcgtagggct 


ttcccccact 


gtttggcttt 


cagttatatg 


gatgatgtgg 


1140 


tattgggggc 


caagtctgta 


caacatcttg 


agtccctttt 


tacctctatt 


accaattttc 


1200 


ttttgtcttt 


gggtatacat 


t 








1221 



<210> 84 

<211> 407 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 50-153 (G145R) + ZZ (serotype y) 
sequence 

<400> 84 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Asp Phe Asn Pro Asn Lys Asp Gin Trp Pro Glu Ala Asn Gin Val Gly 
35 40 45 

Ala Gly Gly Arg Ala Gin His Asp Glu Ala val Asp Asn Lys Phe Asn 
50 ~ 55 60 

Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu 
65 70 75 80 

Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro 
85 90 95 
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Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala 
100 105 110 

Gin Ala Pro Lys Val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala 
115 120 125 

Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn 
130 135 140 

Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu 
145 150 155 160 

Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala 
165 170 175 

Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro 
180 185 190 

Leu Leu Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr 
195 200 205 

lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly 
210 215 220 

Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn 
225 230 235 240 

His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met 
245 250 255 

Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu 
260 265 270 

lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys 
275 280 285 

Pro Leu Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr 
290 295 300 

cys Thr lie Pro Ala Gin Gly Thr Ser Met Phe Pro Ser Cys cys Cys 
305 310 315 * 320 

Thr Lys Pro Ser Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser ser 
325 ' 330 335 

Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser 
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340 345 350 



Trp Leu Ser Leu Leu Val Pro Phe Val Gin Trp Phe Val Gly Leu Ser 
355 360 365 



Pro Thr Val Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro 
370 375 380 



Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe 
385 390 395 400 



Phe Cys Leu Trp val Tyr lie 
405 



<210> 85 

<211> 1221 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 50-153 (Q129R/G145R) + ZZ (serotype y) sequence 



<400> 85 



a uggggacga 


a lclxxc rg l 




c Lgggax lc l 


l Lcccgarca 


ccag l tggac 


DU 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattgggact 


tcaaccccaa 


caaggatcaa 


120 


tggccagagg 


caaatcaggt 


aggagcgggc 


ggccgcgcgc 


aacacgatga 


agccgtagac 


180 


aacaaattca 


acaaagaaca 


acaaaacgcg 


ttctatgaga 


tcttacattt 


acctaactta 


240 


aacgaagaac 


aacgaaacgc 


cttcatccaa 


agtttaaaag 


atgacccaag 


ccaaagcgct 


300 


aaccttttag 


cagaagctaa 


aaagctaaat 


gatgctcagg 


cgccgaaagt 


agacaacaaa 


360 


ttcaacaaag 


aacaacaaaa 


cgcgttctat 


gagatcttac 


atttacctaa 


cttaaacgaa 


420 


gaacaacgaa 


acgccttcat 


ccaaagttta 


aaagatgacc 


caagccaaag 


cgctaacctt 


480 


ttagcagaag 


ctaaaaagct 


aaatgatgct 


caggcgccga 


aagcggccgc 


ccctgcaccg 


540 


aacatggaga 


acacaacatc 


aggattccta 


ggacccctgc 


tcgtgttaca 


ggcggggttt 


600 


ttcttgttga 


caagaatcct 


cacaatacca 


cagagtctag 


actcgtggtg 


gacttctctc 


660 


aattttctag 


ggggagcacc 


cacgtgtcct 


ggccaaaatt 


cgcagtcccc 


aacctccaat 


720 


cactcaccaa 


cctcttgtcc 


tccaatttgt 


cctggctatc 


gctggatgtg 


tctgcggcgt 


780 


tttatcatat 


tcctcttcat 


cctgctgcta 


tgcctcatct 


tcttgttggt 


tcttctggac 


840 


taccaaggta 


tgttgcccgt 


ttgtcctcta 


cttccaggaa 


catcaaccac 


cagcacgggg 


900 


ccatgcaaga 


cctgcacgat 


tcctgctcga 


ggaacctcta 


tgtttccctc 


ttgttgctgt 


960 


acaaaacctt 


cggacagaaa 


ctgcacttgt 


attcccatcc 


catcatcctg 


ggctttcgca 


1020 


agattcctat 


gggagtgggc 


ctcagtccgt 


ttctcctggc 
Page 


tcagtttact 
67 


agtgccattt 


1080 



sequence listing_ST25.txt 

gttcagtggt tcgtagggct ttcccccact gtttggcttt cagttatatg gatgatgtgg 1140 

tattgggggc caagtctgta caacatcttg agtccctttt tacctctatt accaattttc 1200 

ttttgtcttt gggtatacat t 1221 

<210> 86 

<211> 407 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 50-153 (Q129R/G145R) + zz (serotype y) 
sequence 

<400> 86 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Asp Phe Asn Pro Asn Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly 
35 40 45 

Ala Gly Gly Arg Ala Gin His Asp Glu Ala val Asp Asn Lys Phe Asn 
50 55 60 

Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu 
65 70 75 80 

Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro 
85 90 95 

Ser Gin ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala 
100 105 110 

Gin Ala Pro Lys val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala 
115 120 125 

Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn 
130 135 140 

Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin ser Ala Asn Leu 
145 150 155 160 

Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala 
165 170 175 
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Ala Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro 
180 185 190 

Leu Leu val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr 
195 200 205 

lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly 
210 215 220 

Gly Ala Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn 
225 230 235 240 

His Ser Pro Thr ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met 
245 250 * 255 

Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu 
260 265 270 

lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys 
275 280 285 

Pro Leu Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr 
290 295 300 

Cys Thr lie Pro Ala Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys 
305 310 315 320 

Thr Lys Pro Ser Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser Ser 
325 ~ 330 335 

Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser 
340 345 350 

Trp Leu Ser Leu Leu val Pro Phe Val Gin Trp Phe val Gly Leu Ser 
355 360 365 

Pro Thr Val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro 
370 375 380 

Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe 
385 390 395 400 

Phe Cys Leu Trp val Tyr lie 
405 



<210> 87 
<211> 1020 
<212> DNA 
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<213> artificial sequence 
<220> 

<223> 50-153 + EGF (serotype y) sequence 



<400> 87 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattgggact 


tcaaccccaa 


caaggatcaa 


120 


tggccagagg 


caaatcaggt 


aggagcgggc 


ggccgcatga 


actctgattc 


cgaatgcccg 


180 


ctgtctcatg 


acggttactg 


cctgcatgat 


ggcgtatgca tgtacatcga 


agctctggac 


240 


aaatacgcat 


gcaactgtgt 


tgtaggttac 


atcggcgaac gttgccagta 


tcgcgacctg 


300 


aaatggtggg 


aactgcgtaa 


ggcggccgcc 


cctgcaccga acatggagaa 


cacaacatca 


360 


ggattcctag 


gacccctgct 


cgtgttacag 


gcggggtttt 


tcttgttgac 


aagaatcctc 


420 


acaataccac 


agagtctaga 


ctcgtggtgg 


acttctctca 


attttctagg 


gggagcaccc 


480 


acgtgtcctg 


gccaaaattc 


gcagtcccca 


acctccaatc 


actcaccaac 


ctcttgtcct 


540 


ccaatttgtc 


ctggctatcg 


ctggatgtgt 


ctgcggcgtt 


ttatcatatt 


cctcttcatc 


600 








cttctggact 


accaaggtat 


a 1" "t a c c c a t* t* 


660 


tgtcctctac 


ttccaggaac 


atcaaccacc 


agcacggggc 


catgcaagac 


ctgcacgatt 


720 


cctgctcaag 


gaacctctat 


gtttccctct 


tgttgctgta 


caaaaccttc 


ggacggaaac 


780 


tgcacttgta 


ttcccatccc 


atcatcctgg 


gctttcgcaa 


gattcctatg 


ggagtgggcc 


840 


tcagtccgtt 


tctcctggct 


cagtttacta 


gtgccatttg 


ttcagtggtt 


cgtagggctt 


900 


tcccccactg 


tttggctttc 


agttatatgg 


atgatgtggt 


attgggggcc 


aagtctgtac 


960 


aacatcttga 


gtcccttttt 


acctctatta 


ccaattttct 


tttgtctttg 


ggtatacatt 


1020 



<210> 88 

<211> 340 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 50-153 + EGF (serotype y) sequence 

<400> 88 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
1 5 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Asp Phe Asn Pro Asn Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly 
35 40 45 
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Ala Gly Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp 
50 55 60 

Gly Tyr Cys Leu His Asp Gly val Cys Met Tyr lie Glu Ala Leu Asp 
65 70 75 80 

Lys Tyr Ala Cys Asn Cys val val Gly Tyr lie Gly Glu Arg Cys Gin 
85 90 95 

Tyr Arg Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala 
100 105 " 110 

Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val 
115 120 125 

Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin 
130 135 140 

Ser Leu Asp Ser Trp Trp Thr ser Leu Asn Phe Leu Gly Gly Ala Pro 
145 150 155 160 

Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr ser Asn His ser Pro 
165 170 175 

Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg 
180 185 190 

Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu 
195 200 205 

Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu 
210 215 220 

Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie 
225 230 235 240 

Pro Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro 
245 250 255 

Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe 
260 265 270 

Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser 
275 280 285 

Leu Leu val Pro Phe Val Gin Trp Phe val Gly Leu Ser Pro Thr val 
290 295 300 
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Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr 
305 310 315 320 

Asn lie Leu ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu 
325 330 335 

Trp val Tyr lie 
340 

<210> 89 
<211> 1020 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 50-153 (Q129R) + EGF (serotype y) sequence 



<400> 89 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattgggact 


tcaaccccaa 


caaggatcaa 


120 


Lyyu-dydyy 


Laaa Li_ayy l 


ayyd.yt.yyyL 


yyuLyLatyd 


artrtfiattr 
dLLLLydLLL 


Ly dd Ly LLLy 


XOU 


ctgtctcatg 


acggttactg 


cctgcatgat 


ggcgtatgca 


tgtacatcga 


agctctggac 


240 


aaatacgcat 


gcaactgtgt 


tgtaggttac 


atcggcgaac 


gttgccagta 


tcgcgacctg 


300 


aaatggtggg 


aactgcgtaa 


ggcggccgcc 


cctgcaccga 


acatggagaa 


cacaacatca 


360 


ggattcctag 


gacccctgct 


cgtgttacag 


gcggggtttt 


tcttgttgac 


aagaatcctc 


420 


acaataccac 


agagtctaga 


ctcgtggtgg 


acttctctca 


attttctagg 


gggagcaccc 


480 


acgtgtcctg 


gccaaaattc 


gcagtcccca 


acctccaatc 


actcaccaac 


ctcttgtcct 


540 


ccaatttgtc 


ctggctatcg 


ctggatgtgt 


ctgcggcgtt 


ttatcatatt 


cctcttcatc 


600 


ctgctgctat 


gcctcatctt 


cttgttggtt 


cttctggact 


accaaggtat 


gttgcccgtt 


660 


tgtcctctac 


ttccaggaac 


atcaaccacc 


agcacggggc 


catgcaagac 


ctgcacgatt 


720 


cctgctcgag 


gaacctctat 


gtttccctct 


tgttgctgta 


caaaaccttc 


ggacggaaac 


780 


tgcacttgta 


ttcccatccc 


atcatcctgg 


gctttcgcaa 


gattcctatg 


ggagtgggcc 


840 


tcagtccgtt 


tctcctggct 


cagtttacta 


gtgccatttg 


ttcagtggtt 


cgtagggctt 


900 


tcccccactg 


tttggctttc 


agttatatgg 


atgatgtggt 


attgggggcc 


aagtctgtac 


960 


aacatcttga 


gtcccttttt 


acctctatta 


ccaattttct 


tttgtctttg 


ggtatacatt 


1020 



<210> 90 
<211> 340 
<212> PRT 

<213> artificial sequence 
<220> 
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<223> Proteins corresponding to 50-153 (Q129R) + EGF (serotype y) 
sequence 

<400> 90 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
1 5 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Asp Phe Asn Pro Asn Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly 
35 40 45 

Ala Gly Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp 
50 55 60 

Gly Tyr Cys Leu His Asp Gly Val Cys Met Tyr lie Glu Ala Leu Asp 
65 70 75 80 

Lys Tyr Ala Cys Asn Cys val val Gly Tyr lie Gly Glu Arg Cys Gin 
85 90 95 

Tyr Arg Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala 
100 105 110 

Pro Asn Met Glu Asn Thr Thr ser Gly Phe Leu Gly Pro Leu Leu Val 
115 120 125 

Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin 
130 135 ^ 140 

Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro 
145 150 155 160 

Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr ser Asn His Ser Pro 
165 170 175 

Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg 
180 185 ~ 190 

Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu 
195 200 205 

Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu 
210 215 220 

Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie 
225 230 235 240 
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Pro Ala Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro 
245 250 255 

Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe 
260 265 270 

Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser 
275 280 285 

Leu Leu val Pro Phe Val Gin Trp Phe val Gly Leu Ser Pro Thr val 
290 295 300 

Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr 
305 310 315 320 

Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu 
325 330 335 

Trp val Tyr lie 
340 

<210> 91 
<211> 1020 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 50-153 (G145R) + EGF (serotype y) sequence 
<400> 91 



atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattgggact 


tcaaccccaa 


caaggatcaa 


120 


tggccagagg 


caaatcaggt 


aggagcgggc 


ggccgcatga 


actctgattc 


cgaatgcccg 


180 


ctgtctcatg 


acggttactg 


cctgcatgat 


ggcgtatgca 


tgtacatcga 


agctctggac 


240 


aaatacgcat 


gcaactgtgt 


tgtaggttac 


atcggcgaac 


gttgccagta tcgcgacctg 


300 


aaatggtggg 


aactgcgtaa 


ggcggccgcc 


cctgcaccga 


acatggagaa 


cacaacatca 


360 


ggattcctag 


gacccctgct 


cgtgttacag 


gcggggtttt 


tcttgttgac 


aagaatcctc 


420 


acaataccac 


agagtctaga 


ctcgtggtgg 


acttctctca 


attttctagg 


gggagcaccc 


480 


acgtgtcctg 


gccaaaattc 


gcagtcccca 


acctccaatc 


actcaccaac 


ctcttgtcct 


540 


ccaatttgtc 


ctggctatcg 


ctggatgtgt 


ctgcggcgtt 


ttatcatatt 


cctcttcatc 


600 


ctgctgctat 


gcctcatctt 


cttgttggtt 


cttctggact 


accaaggtat 


gttgcccgtt 


660 


tgtcctctac 


ttccaggaac 


atcaaccacc 


agcacggggc 


catgcaagac 


ctgcacgatt 


720 
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cctgctcaag gaacctctat gtttccctct tgttgctgta caaaaccttc ggacagaaac 780 

tgcacttgta ttcccatccc atcatcctgg gctttcgcaa gattcctatg ggagtgggcc 840 

tcagtccgtt tctcctggct cagtttacta gtgccatttg ttcagtggtt cgtagggctt 900 

tcccccactg tttggctttc agttatatgg atgatgtggt attgggggcc aagtctgtac 960 

aacatcttga gtcccttttt acctctatta ccaattttct tttgtctttg ggtatacatt 1020 

<210> 92 

<211> 340 

<212> PRT 

<213> arti f i ci al sequence 
<220> 

<223> Proteins corresponding to 50-153 (G145R) + EGF (serotype y) 
sequence 

<400> 92 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Asp Phe Asn Pro Asn Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly 
35 40 45 

Ala Gly Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp 
50 ~ 55 60 

Gly Tyr Cys Leu His Asp Gly Val Cys Met Tyr lie Glu Ala Leu Asp 
65 70 75 80 

Lys Tyr Ala Cys Asn Cys Val val Gly Tyr lie Gly Glu Arg Cys Gin 
85 90 95 

Tyr Arg Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala 
100 105 110 

Pro Asn Met Glu Asn Thr Thr ser Gly Phe Leu Gly Pro Leu Leu Val 
115 120 125 

Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin 
130 135 140 

Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro 
145 150 155 * 160 

Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro 
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165 170 175 

Thr ser cys Pro Pro He Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg 
180 185 190 

Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu 
195 200 205 

Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu 
210 215 220 

Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie 
225 230 235 240 

Pro Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro 
245 250 255 

Ser Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe 
260 265 270 

Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser 
275 280 285 

Leu Leu val Pro Phe val Gin Trp Phe Val Gly Leu Ser Pro Thr val 
290 295 300 

Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr 
305 310 315 320 

Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu 
325 330 335 

Trp val Tyr lie 
340 

<210> 93 
<211> 1020 
<212> DNA 

<213> arti f i ci al sequence 
<220> 

<223> 50-153 (Q129R/G145R) + EGF (serotype y) sequence 
<400> 93 

atggggacga atctttctgt tcccaatcct ctgggattct ttcccgatca ccagttggac 60 

cctgcgttcg gagccaactc aaacaatcca gattgggact tcaaccccaa caaggatcaa 120 

tggccagagg caaatcaggt aggagcgggc ggccgcatga actctgattc cgaatgcccg 180 

ctgtctcatg acggttactg cctgcatgat ggcgtatgca tgtacatcga agctctggac 240 
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aaatacgcat 


gcaactgtgt 


tgtaggttac 


atcggcgaac 


gttgccagta tcgcgacctg 


300 


aaatggtggg 


aactgcgtaa 


ggcggccgcc 


cctgcaccga 


acatggagaa cacaacatca 


360 


ggattcctag 


gacccctgct 


cgtgttacag 


gcggggtttt 


tcttgttgac aagaatcctc 


420 


acaataccac 


agagtctaga 


ctcgtggtgg 


acttctctca 


attttctagg 


gggagcaccc 


480 


acgtgtcctg 


gccaaaattc 


gcagtcccca 


acctccaatc 


actcaccaac 


ctcttgtcct 


540 


ccaatttgtc 


ctggctatcg 


ctggatgtgt 


ctgcggcgtt 


ttatcatatt 


cctcttcatc 


600 


ctgctgctat 


gcctcatctt 


cttgttggtt 


cttctggact 


accaaggtat 


gttgcccgtt 


660 


tgtcctctac 


ttccaggaac 


atcaaccacc 


agcacggggc 


catgeaagae 


ctgeacgatt 


720 


rrtnrtrnan 
v_v_ l, y v_ n_yay 




ntttrrrtrt 


Ly l i>yc Ly La 


caaaaccttc 


ggacagaaac 


7RO 
t ou 


tgcacttgta 


ttcccatccc 


atcatcctgg 


getttegcaa 


gattcctatg 


ggagtgggcc 


840 


tcagtccgtt 


tctcctggct 


cagtttacta 


gtgccatttg 


ttcagtggtt 


egtagggett 


900 


tcccccactg 


tttggctttc 


agttatatgg 


atgatgtggt 


attgggggee 


aagtctgtac 


960 


aacatcttga 


gtcccttttt 


acctctatta 


ccaattttct 


tttgtctttg 


ggtatacatt 


1020 



<210> 94 

<211> 340 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 50-153 (Q129R/G145R) + EGF (serotype y) 
sequence 

<400> 94 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn ser Asn Asn Pro Asp Trp 
20 25 30 

Asp Phe Asn Pro Asn Lys Asp Gin Trp Pro Glu Ala Asn Gin Val Gly 
35 40 45 

Ala Gly Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp 
50 55 60 



Gly Tyr Cys Leu His Asp Gly val Cys Met Tyr lie Glu Ala Leu Asp 
65 70 75 80 



Lys Tyr Ala Cys Asn Cys Val val Gly Tyr lie Gly Glu Arg cys Gin 
85 90 95 
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Tyr Arg Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala 
100 105 110 

Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val 
115 120 125 

Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin 
130 135 ~ 140 

Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro 
145 150 155 160 

Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro 
165 170 175 

Thr Ser cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg 
180 185 190 

Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu cys Leu lie Phe Leu 
195 200 205 

Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu 
210 215 220 

Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie 
225 230 235 240 

Pro Ala Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro 
245 250 255 

Ser Asp Arg Asn cys Thr Cys lie Pro lie Pro ser ser Trp Ala Phe 
260 265 270 

Ala Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser 
275 280 285 

Leu Leu Val Pro Phe val Gin Trp Phe Val Gly Leu Ser Pro Thr val 
290 295 300 

Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr 
305 310 315 320 

Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu 
325 330 335 

Trp val Tyr lie 
340 
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<210> 95 

<211> 879 

<212> DNA 

<213> arti f i ci al sequence 
<220> 

<223> 50-153 + STR-tag (serotype y) sequence 



<400> 95 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattgggact 


tcaaccccaa 


caaggatcaa 


120 


tggccagagg 


caaatcaggt 


aggagcgggc 


ggccgctgga 


gccacccgca 


gttcgaaaaa 


180 


gcggccgccc 


ctgcaccgaa 


catggagaac 


acaacatcag 


gattcctagg 


acccctgctc 


240 


gtgttacagg 


cggggttttt 


cttgttgaca 


agaatcctca 


caataccaca 


gagtctagac 


300 


tcgtggtgga 


cttctctcaa 


ttttctaggg 


ggagcaccca 


cgtgtcctgg 


ccaaaattcg 


360 


Lay tLLLLad 


LUCLLdaLLd 


r~ -\- r~ ~s r r ry r 
CLCdLCddCC 


lc t tgi.cc LC 


caa u l ug ucc 


tggc La ucgc 




tggatgtgtc 


tgcggcgttt 


tatcatattc 


ctcttcatcc 


tgctgctatg 


cctcatcttc 


480 


ttgttggttc 


ttctggacta 


ccaaggtatg 


ttgcccgttt 


gtcctctact 


tccaggaaca 


540 


tcaaccacca 


gcacggggcc 


atgcaagacc 


tgcacgattc 


ctgctcaagg 


aacctctatg 


600 


tttccctctt 


gttgctgtac 


aaaaccttcg 


gacggaaact 


gcacttgtat 


tcccatccca 


660 


tcatcctggg 


ctttcgcaag 


attcctatgg 


gagtgggcct 


cagtccgttt 


ctcctggctc 


720 


agtttactag 


tgccatttgt 


tcagtggttc 


gtagggcttt 


cccccactgt 


ttggctttca 


780 


gttatatgga 


tgatgtggta 


ttgggggcca 


agtctgtaca 


acatcttgag 


tcccttttta 


840 


cctctattac 


caattttctt 


ttgtctttgg 


gtatacatt 






879 



<210> 96 
<211> 293 
<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 50-153 + STR-tag (serotype y) sequence 
<400> 96 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 



His Gln Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Asp Phe Asn Pro Asn Lys Asp Gln Trp Pro Glu Ala Asn Gln val Gly 
35 40 45 
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Ala Gly Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro 
50 55 60 

Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu 
65 70 75 80 

val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro 
85 90 ~ 95 

Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala 
100 105 110 

Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser 
115 120 125 

pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu 
130 * 135 140 

Arg Arg Phe He lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe 
145 " 150 155 160 

Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu 
165 170 175 

Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr 
180 185 190 

lie Pro Ala Gin Gly Thr ser Met Phe Pro Ser Cys Cys Cys Thr Lys 
195 200 205 

Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala 
210 215 220 

Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu 
225 " 230 235 240 

Ser Leu Leu val Pro Phe val Gin Trp Phe Val Gly Leu Ser Pro Thr 
245 250 255 

val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu 
260 265 270 

Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys 
275 280 285 

Leu Trp val Tyr lie 
290 
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<210> 97 

<211> 879 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 50-153 (Q129R) + STR-tag (serotype y) sequence 



<400> 97 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattgggact 


tcaaccccaa 


caaggatcaa 


120 


tggccagagg 


caaatcaggt 


aggagcgggc 


ggccgctgga 


gccacccgca 


gttcgaaaaa 


180 


gcggccgccc 


ctgcaccgaa 


catggagaac 


acaacatcag 


gattcctagg 


acccctgctc 


240 


gtgttacagg 


cggggttttt 


cttgttgaca 


agaatcctca 


caataccaca 


gagtctagac 


300 


tcgtggtgga 


cttctctcaa 


ttttctaggg 


ggagcaccca 


cgtgtcctgg 


ccaaaattcg 


360 


Lay Li_L.n_a.d 


rrtrraatra 




LL. L Ly LLL LL 


raatttntrr 
L.d.a LLLy LLL 


Ly y c LaLLyL 




tggatgtgtc 


tgcggcgttt 


tatcatattc 


ctcttcatcc 


tgctgctatg 


cctcatcttc 


480 


ttgttggttc 


ttctggacta 


ccaaggtatg 


ttgcccgttt 


gtcctctact 


tccaggaaca 


540 


tcaaccacca 


gcacggggcc 


atgcaagacc 


tgcacgattc 


ctgctcgagg 


aacctctatg 


600 


tttccctctt 


gttgctgtac 


aaaaccttcg 


gacggaaact 


gcacttgtat 


tcccatccca 


660 


tcatcctggg 


ctttcgcaag 


attcctatgg 


gagtgggcct 


cagtccgttt 


ctcctggctc 


720 


agtttactag 


tgccatttgt 


tcagtggttc 


gtagggcttt 


cccccactgt 


ttggctttca 


780 


gttatatgga 


tgatgtggta 


ttgggggcca 


agtctgtaca 


acatcttgag 


tcccttttta 


840 


cctctattac 


caattttctt 


ttgtctttgg 


gtatacatt 






879 



<210> 98 

<211> 293 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 50-153 (Q129R) + STR-tag (serotype y) 
sequence 

<400> 98 

Met Gly Thr Asn Leu Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
15 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 

Asp Phe Asn Pro Asn Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly 
35 40 45 
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Ala Gly Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro 
50 55 60 

Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu 
65 70 75 80 

val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro 
85 90 95 

Gin ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala 
100 105 110 

Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser 
115 120 125 

Pro Thr Ser cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu 
130 135 140 

Arg Arg Phe He lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe 
145 150 155 160 

Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu 
165 170 175 

Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr 
180 185 190 

lie Pro Ala Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys 
195 200 205 

Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala 
210 215 220 

Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu 
225 ~ 230 235 ~ 240 

Ser Leu Leu val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr 
245 250 255 

val Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu 
260 265 270 

Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys 
275 280 285 

Leu Trp Val Tyr lie 
290 

Page 82 



sequence listing_ST25.txt 

<210> 99 

<211> 879 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 50-153 (G145R) + STR-tag (serotype y) sequence 

<400> 99 



atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattgggact 


tcaaccccaa 


caaggatcaa 


120 


tggccagagg 


caaatcaggt 


aggagcgggc 


ggccgctgga 


gccacccgca 


gttcgaaaaa 


180 


gcggccgccc 


ctgcaccgaa 


catggagaac 


acaacatcag 


gattcctagg 


acccctgctc 


240 


gtgttacagg 


cggggttttt 


cttgttgaca 


agaatcctca 


caataccaca 


gagtctagac 


300 




rt*1*rt*rtraa 

V_ L 1 V_ L V. L. V_C*C* 


in it ^-^yyy 




*-y i-y in. iyy 


v. c* ci c* a i_ i_ v- y 


360 


cagtccccaa 


cctccaatca 


ctcaccaacc 


tcttgtcctc 


caatttgtcc 


tggctatcgc 


420 


tggatgtgtc 


tgcggcgttt 


tatcatattc 


ctcttcatcc 


tgctgctatg 


cctcatcttc 


480 


ttgttggttc 


ttctggacta 


ccaaggtatg 


ttgcccgttt 


gtcctctact 


tccaggaaca 


540 


tcaaccacca 


gcacggggcc 


atgcaagacc 


tgcacgattc 


ctgctcaagg 


aacctctatg 


600 


tttccctctt 


gttgctgtac 


aaaaccttcg 


gacagaaact 


gcacttgtat 


tcccatccca 


660 


tcatcctggg 


ctttcgcaag 


attcctatgg 


gagtgggcct 


cagtccgttt 


ctcctggctc 


720 


agtttactag 


tgccatttgt 


tcagtggttc 


gtagggcttt 


cccccactgt 


ttggctttca 


780 


gttatatgga 


tgatgtggta 


ttgggggcca 


agtctgtaca 


acatcttgag 


tcccttttta 


840 


cctctattac 


caattttctt 


ttgtctttgg 


gtatacatt 






879 



<210> 100 

<211> 293 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 50-153 (G145R) + STR-tag (serotype y) 
sequence 

<400> 100 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
1 5 10 15 

His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 



Asp Phe Asn Pro Asn Lys Asp Gin Trp Pro Glu Ala Asn Gin Val Gly 
35 40 45 
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Ala Gly Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro 
50 55 60 

Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu 
65 70 75 * 80 

Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro 
85 90 ~ 95 

Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala 
100 105 110 

Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser 
115 120 125 

Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu 
130 135 140 

Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe 
145 150 155 160 

Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu 
165 170 175 

Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr 
180 185 190 

lie Pro Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys 
195 200 205 

Pro Ser Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala 
210 215 220 

Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu 
225 230 235 ~ 240 

Ser Leu Leu Val Pro Phe val Gin Trp Phe Val Gly Leu Ser Pro Thr 
245 250 255 

val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu 
260 265 270 

Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys 
275 280 285 

Leu Trp Val Tyr lie 
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<210> 101 

<211> 879 

<212> DNA 

<213> artificial sequence 



<220> 

<223> 50-153 (Q129R/G145R) + STR-tag (serotype y) sequence 



<400> 101 
atggggacga 


atctttctgt 


tcccaatcct 


ctgggattct 


ttcccgatca 


ccagttggac 


60 


cctgcgttcg 


gagccaactc 


aaacaatcca 


gattgggact 


tcaaccccaa 


caaggatcaa 


120 


tggccagagg 


caaatcaggt 


aggagcgggc 


ggccgctgga 


gccacccgca 


gttcgaaaaa 


180 


gcggccgccc 


ctgcaccgaa 


catggagaac 


acaacatcag 


gattcctagg 


acccctgctc 


240 


gtgttacagg 


cggggttttt 


cttgttgaca 


agaatcctca 


caataccaca 


gagtctagac 


300 


tcatqqtqqa 


cttctctcaa 


ttttctaaaa 


aaaacaccca 

r3 7"j ^* V* X* V* X* V* 


catatcctaa 


ccaaaattca 


360 


cagtccccaa 


cctccaatca 


ctcaccaacc 


tcttgtcctc 


caatttgtcc 


tggctatcgc 


420 


tggatgtgtc 


tgcggcgttt 


tatcatattc 


ctcttcatcc 


tgctgctatg 


cctcatcttc 


480 


ttgttggttc 


ttctggacta 


ccaaggtatg 


ttgcccgttt 


gtcctctact 


tccaggaaca 


540 


tcaaccacca 


gcacggggcc 


atgcaagacc 


tgcacgattc 


ctgctcgagg 


aacctctatg 


600 


tttccctctt 


gttgctgtac 


aaaaccttcg 


gacagaaact 


gcacttgtat 


tcccatccca 


660 


tcatcctggg 


ctttcgcaag 


attcctatgg 


gagtgggcct 


cagtccgttt 


ctcctggctc 


720 


agtttactag 


tgccatttgt 


tcagtggttc 


gtagggcttt 


cccccactgt 


ttggctttca 


780 


gttatatgga 


tgatgtggta 


ttgggggcca 


agtctgtaca 


acatcttgag 


tcccttttta 


840 


cctctattac 


caattttctt 


ttgtctttgg 


gtatacatt 






879 



<210> 102 

<211> 293 

<212> PRT 

<213> arti f i ci al sequence 
<220> 

<223> Proteins corresponding to 50-153 (Q129R/G145R) + STR-tag 
(serotype y) sequence 

<400> 102 

Met Gly Thr Asn Leu Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp 
1 5 10 15 



His Gin Leu Asp Pro Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp 
20 25 30 



Asp Phe Asn Pro Asn Lys Asp Gin Trp Pro Glu Ala Asn Gin Val Gly 
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35 40 45 

Ala Gly Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro 
50 55 60 

Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu 
65 70 75 80 

val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro 
85 90 * 95 

Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala 
100 105 110 

Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser 
115 120 125 

Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu 
130 135 140 

Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe 
145 150 155 160 

Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu 
165 170 175 

Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr 
180 185 190 

lie Pro Ala Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys 
195 ~ 200 205 

Pro Ser Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala 
210 215 220 

Phe Ala Arg Phe Leu Trp Glu Trp Ala ser val Arg Phe Ser Trp Leu 
225 230 235 240 

Ser Leu Leu val Pro Phe val Gin Trp Phe Val Gly Leu Ser Pro Thr 
245 250 255 

val Trp Leu Ser val He Trp Met Met Trp Tyr Trp Gly Pro Ser Leu 
260 265 270 

Tyr Asn lie Leu ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys 
275 280 285 
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Leu Trp val Tyr lie 
290 
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<210> 103 

<211> 1167 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 32-164 + ZZ (serotype d; claiml2) sequence 



<400> 103 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gacggcggcc 


gcgcgcaaca 


cgatgaagcc 


120 


gtagacaaca 


aattcaacaa 


agaacaacaa 


aacgcgttct 


atgagatctt 


acatttacct 


180 


aacttaaacg 


aagaacaacg 


aaacgccttc 


atccaaagtt 


taaaagatga 


cccaagccaa 


240 


agcgctaacc 


ttttagcaga 


agctaaaaag 


ctaaatgatg 


ctcaggcgcc 


gaaagtagac 


300 


aacaaattca 


acaaagaaca 


acaaaacgcg 


ttctatgaga 


tcttacattt 


acctaactta 


360 


aacgaagaac 


aacgaaacgc 


cttcatccaa 


agtttaaaag 


atgacccaag 


ccaaagcgct 


420 


aaccttttag 


cagaagctaa 


aaagctaaat 


gatgctcagg 


cgccgaaagc 


ggccgcccct 


480 


gcaccgaaca 


tggagaacac 


aacatcagga 


ttcctaggac 


ccctgctcgt 


gttacaggcg 


540 


gggtttttct 


tgttgacaag 


aatcctcaca 


ataccacaga 


gtctagactc 


gtggtggact 


600 


tctctcaatt 


ttctaggggg 


agcacccacg 


tgtcctggcc 


aaaattcgca 


gtccccaacc 


660 


tccaatcact 


caccaacctc 


ttgtcctcca 


atttgtcctg 


gctatcgctg 


gatgtgtctg 


720 


cggcgtttta 


tcatattcct 


cttcatcctg 


ctgctatgcc 


tcatcttctt 


gttggttctt 


780 


ctggactacc 


aaggtatgtt 


gcccgtttgt 


cctctacttc 


caggaacatc 


aaccaccagc 


840 


acggggccat 


gcaagacctg 


cacgattcct 


gctcaaggaa 


cctctatgtt 


tccctcttgt 


900 


tgctgtacaa 


aaccttcgga 


cggaaactgc 


acttgtattc 


ccatcccatc 


atcctgggct 


960 


ttcgcaagat 


tcctatggga 


gtgggcctca 


gtccgtttct 


cctggctcag 


tttactagtg 


1020 


ccatttgttc 


agtggttcgt 


agggctttcc 


cccactgttt 


ggctttcagt 


tatatggatg 


1080 


atgtggtatt 


gggggccaag 


tctgtacaac 


atcttgagtc 


cctttttacc 


tctattacca 


1140 


attttctttt 


gtctttgggt 


atacatt 








1167 



<210> 104 
<211> 389 
<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 32-164 + ZZ (serotype d; claiml2) 
sequence 
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<400> 104 

Met Gly Gly Trp ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Gly 
20 25 30 

Gly Arg Ala Gin His Asp Glu Ala Val Asp Asn Lys Phe Asn Lys Glu 
35 40 45 

Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu 
50 55 60 

Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin 
65 " 70 75 80 

Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala 
85 90 95 

Pro Lys Val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr 
100 105 110 

Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe 
115 120 125 

lie Gin Ser Leu Lys Asp Asp Pro Ser Gin ser Ala Asn Leu Leu Ala 
130 135 140 

Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala Pro 
145 150 155 160 

Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu 
165 170 175 

val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro 
180 185 " 190 

Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala 
195 200 205 

Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr ser Asn His Ser 
210 ' 215 220 

Pro Thr ser cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu 
225 230 235 240 

Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe 
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245 250 255 

Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu 
260 265 270 

Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr 
275 280 285 

lie Pro Ala Gin Gly Thr ser Met Phe Pro Ser Cys Cys Cys Thr Lys 
290 295 300 

Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala 
305 310 315 320 

Phe Ala Arg Phe Leu Trp Glu Trp Ala ser Val Arg Phe Ser Trp Leu 
325 330 335 

Ser Leu Leu val Pro Phe val Gin Trp Phe val Gly Leu ser Pro Thr 
340 345 350 

val Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu 
355 360 365 

Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys 
370 375 380 

Leu Trp val Tyr lie 
385 



<210> 105 
<211> 1167 
<212> DNA 

<213> artificial sequence 










<220> 

<223> 32-164 (Q129R) 


+ zz (serotype d) sequence 






<400> 105 

atgggaggtt ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat tctttcccga 


tcaccagttg 


gacggcggcc 


gcgcgcaaca 


cgatgaagcc 


120 


gtagacaaca aattcaacaa 


agaacaacaa 


aacgcgttct 


atgagatctt 


acatttacct 


180 


aacttaaacg aagaacaacg 


aaacgccttc 


atccaaagtt 


taaaagatga 


cccaagccaa 


240 


agcgctaacc ttttagcaga 


agctaaaaag 


ctaaatgatg 


ctcaggcgcc 


gaaagtagac 


300 


aacaaattca acaaagaaca 


acaaaacgcg 


ttctatgaga 


tcttacattt 


acctaactta 


360 


aacgaagaac aacgaaacgc 


cttcatccaa 


agtttaaaag 


atgacccaag 


ccaaagcgct 


420 


aaccttttag cagaagctaa 


aaagctaaat 


gatgctcagg 
Page 


cgccgaaagc 
89 


ggccgcccct 


480 



sequence listing_ST25.txt 



gcaccgaaca 


tggagaacac 


aacatcagga 


ttcctaggac 


ccctgctcgt 


gttacaggcg 


540 


gggtttttct 


tgttgacaag 


aatcctcaca 


ataccacaga 


gtctagactc 


gtggtggact 


600 


tctctcaatt 


ttctaggggg 


agcacccacg 


tgtcctggcc 


aaaattcgca 


gtccccaacc 


660 


tccaatcact 


caccaacctc 


ttgtcctcca 


atttgtcctg 


gctatcgctg 


gatgtgtctg 


720 


cggcgtttta 


tcatattcct 


cttcatcctg 


ctgctatgcc 


tcatcttctt 


gttggttctt 


780 


ctggactacc 


aaggtatgtt 


gcccgtttgt 


cctctacttc 


caggaacatc 


aaccaccagc 


840 


acggggccat 


gcaagacctg 


cacgattcct 


gctcgaggaa 


cctctatgtt 


tccctcttgt 


900 




aaLLLLtyyd 


(.yyd.adc.Lyu 


dLLLy LdLtL 


LLdLLLLdL L 






ttcgcaagat 


tcctatggga 


gtgggcctca 


gtccgtttct 


cctggctcag 


tttactagtg 


1020 


ccatttgttc 


agtggttcgt 


agggctttcc 


cccactgttt 


ggctttcagt 


tatatggatg 


1080 


atgtggtatt 


gggggccaag 


tctgtacaac 


atcttgagtc 


cctttttacc 


tctattacca 


1140 


attttctttt 


gtctttgggt 


atacatt 








1167 



<210> 106 

<211> 388 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 32-164 (Q129R) + ZZ (serotype d) 
sequence 

<400> 106 

Met Gly Gly Trp Ser ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Gly 
20 25 30 

Gly Arg Ala Gin His Asp Glu Ala val Asp Asn Lys Phe Asn Lys Glu 
35 40 45 

Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu 
50 55 60 

Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin 
65 70 75 80 

Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala 
85 90 95 

Pro Lys val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr 
100 105 110 
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Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe 

115 120 125 

lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala 

130 135 140 

Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala Pro 

145 150 155 160 

Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu 

165 170 175 

Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro 

180 185 190 

Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala 

195 200 205 

Pro Thr Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser 

210 215 220 

Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu 

225 230 235 ~ 240 

Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe 

245 250 255 

Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu 

260 265 270 

Leu Pro Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr 

275 280 285 

lie Pro Ala Arg Gly Thr ser Met Phe Pro Ser Cys Cys Cys Thr Lys 

290 295 300 

Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala 

305 310 315 320 

Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu 

325 330 ~ 335 

Ser Leu Leu Val Pro Phe Val Gin Trp Phe val Gly Leu Ser Pro Thr 

340 345 350 

val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu 
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360 365 



Tyr Asn He Leu Ser Pro Phe Leu Pro Leu Leu Pro He Phe Phe Cys 
370 375 380 

Leu Trp Val Tyr 
385 

<210> 107 

<211> 1167 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 32-164 (G145R) + ZZ (serotype d) sequence 



<400> 107 



atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gacggcggcc 


gcgcgcaaca 


cgatgaagcc 


120 


gtagacaaca 


aattcaacaa 


agaacaacaa 


aacgcgttct 


atgagatctt 


acatttacct 


180 


aacttaaacg 


aagaacaacg 


aaacgccttc 


atccaaagtt 


taaaagatga 


cccaagccaa 


240 


agcgctaacc 


ttttagcaga 


agctaaaaag 


ctaaatgatg 


ctcaggcgcc 


gaaagtagac 


300 


aacaaattca 


acaaagaaca 


acaaaacgcg 


ttctatgaga 


tcttacattt 


acctaactta 


360 


aacgaagaac 


aacgaaacgc 


cttcatccaa 


agtttaaaag 


atgacccaag 


ccaaagcgct 


420 


aaccttttag 


cagaagctaa 


aaagctaaat 


gatgctcagg 


cgccgaaagc 


ggccgcccct 


480 


gcaccgaaca 


tggagaacac 


aacatcagga 


ttcctaggac 


ccctgctcgt 


gttacaggcg 


540 


gggtttttct 


tgttgacaag 


aatcctcaca 


ataccacaga 


gtctagactc 


gtggtggact 


600 


tctctcaatt 


ttctaggggg 


agcacccacg 


tgtcctggcc 


aaaattcgca 


gtccccaacc 


660 


tccaatcact 


caccaacctc 


ttgtcctcca 


atttgtcctg 


gctatcgctg 


gatgtgtctg 


720 


cggcgtttta 


tcatattcct 


cttcatcctg 


ctgctatgcc 


tcatcttctt 


gttggttctt 


780 


ctggactacc 


aaggtatgtt 


gcccgtttgt 


cctctacttc 


caggaacatc 


aaccaccagc 


840 


acggggccat 


gcaagacctg 


cacgattcct 


gctcaaggaa 


cctctatgtt 


tccctcttgt 


900 


tgctgtacaa 


aaccttcgga 


cagaaactgc 


acttgtattc 


ccatcccatc 


atcctgggct 


960 


ttcgcaagat 


tcctatggga 


gtgggcctca 


gtccgtttct 


cctggctcag 


tttactagtg 


1020 


ccatttgttc 


agtggttcgt 


agggctttcc 


cccactgttt 


ggctttcagt 


tatatggatg 


1080 


atgtggtatt 


gggggccaag 


tctgtacaac 


atcttgagtc 


cctttttacc 


tctattacca 


1140 


attttctttt 


gtctttgggt 


atacatt 








1167 



<210> 108 
<211> 389 
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<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 32-164 (G145R) + ZZ (serotype d) 
sequence 

<400> 108 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Gly 
20 25 30 

Gly Arg Ala Gin His Asp Glu Ala val Asp Asn Lys Phe Asn Lys Glu 
35 40 45 

Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu 
50 55 60 

Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin 
65 70 75 80 

Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala 
85 90 95 

Pro Lys Val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr 
100 105 110 

Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe 
115 120 125 

lie Gin ser Leu Lys Asp Asp Pro Ser Gin ser Ala Asn Leu Leu Ala 
130 135 140 

Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala Pro 
145 * ' 150 155 160 

Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu 
165 170 175 

Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro 
180 185 190 

Gin ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala 
195 200 205 

Pro Thr cys Pro Gly Gin Asn Ser Gin Ser Pro Thr ser Asn His Ser 
210 215 220 
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Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu 
225 230 235 240 

Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe 
245 250 255 

Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu 
260 265 270 

Leu Pro Gly Thr ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr 
275 280 285 

lie Pro Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys 
290 295 300 

Pro Ser Asp Arg Asn Cys Thr cys lie Pro lie Pro Ser Ser Trp Ala 
305 310 315 320 

Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu 
325 330 ^ 335 

Ser Leu Leu val Pro Phe val Gin Trp Phe val Gly Leu Ser Pro Thr 
340 345 350 

Val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu 
355 360 365 

Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys 
370 375 380 

Leu Trp val Tyr lie 
385 

<210> 109 
<211> 1167 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 32-164 (Q129R/G145R) + zz-tag (serotype d) sequence 
<400> 109 

atgggaggtt ggtcttccaa acctcggaaa ggcatgggga cgaatctttc tgttcccaat 60 
cctctgggat tctttcccga tcaccagttg gacggcggcc gcgcgcaaca cgatgaagcc 120 
gtagacaaca aattcaacaa agaacaacaa aacgcgttct atgagatctt acatttacct 180 
aacttaaacg aagaacaacg aaacgccttc atccaaagtt taaaagatga cccaagccaa 240 
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agcgctaacc 


ttttagcaga 
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agctaaaaag ctaaatgatg ctcaggcgcc gaaagtagac 


300 


aacaaattca 


acaaagaaca 


acaaaacgcg 


ttctatgaga 


tcttacattt 


acctaactta 


360 


aacgaagaac 


aacgaaacgc 


cttcatccaa 


agtttaaaag 


atgacccaag 


ccaaagcgct 


420 


aaccttttag 


cagaagctaa 


aaagctaaat 


gatgctcagg 


cgccgaaagc 


ggccgcccct 


480 


gcaccgaaca 


tggagaacac 


aacatcagga 


ttcctaggac 


ccctgctcgt 


gttacaggcg 


540 


gggtttttct 


tgttgacaag 


aatcctcaca 


ataccacaga 


gtctagactc 


gtggtggact 


600 


tctctcaatt 


ttctaggggg 


agcacccacg 


tgtcctggcc 


aaaattcgca 


gtccccaacc 


660 


tccaatcact 


caccaacctc 


ttgtcctcca 


atttgtcctg 


gctatcgctg gatgtgtctg 


720 


cggcgtttta 


tcatattcct 


cttcatcctg 


ctgctatgcc 


tcatcttctt gttggttctt 


780 


ctggactacc 


aaggtatgtt 


gcccgtttgt 


cctctacttc 


caggaacatc 


aaccaccagc 


840 


acggggccat 


gcaagacctg 


cacgattcct 


gctcgaggaa 


cctctatgtt 


tccctcttgt 


900 


tgctgtacaa 


aaccttcgga 


cagaaactgc 


acttgtattc 


ccatcccatc 


atcctgggct 


960 


ttrnraanat 




gtgggcctca 


gtccgtttct 


cctggctcag 


tttactagtg 


1070 


ccatttgttc 


agtggttcgt 


agggctttcc 


cccactgttt 


ggctttcagt 


tatatggatg 


1080 


atgtggtatt 


gggggccaag 


tctgtacaac 


atcttgagtc 


cctttttacc 


tctattacca 


1140 


attttctttt 


gtctttgggt 


atacatt 








1167 



<210> 110 

<211> 389 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 32-164 (Q129R/G145R) + zz-tag (serotype 
d) sequence 

<400> 110 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 * 5 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Gly 
20 25 30 



Gly Arg Ala Gin His Asp Glu Ala Val Asp Asn Lys Phe Asn Lys Glu 
35 40 45 



Gin Gin Asn Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu 
50 55 60 



Glu Gin Arg Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin 
65 ~ 70 75 80 
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Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala 
85 90 95 

Pro Lys Val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr 
100 105 110 

Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe 
115 120 125 

lie Gin ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala 
130 135 140 

Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala Pro 
145 150 155 160 

Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu 
165 170 175 

val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro 
180 185 190 

Gin Ser Leu Asp Ser Trp Trp Thr ser Leu Asn Phe Leu Gly Gly Ala 
195 200 205 

Pro Thr Cys Pro Gly Gin Asn ser Gin ser Pro Thr Ser Asn His Ser 
210 215 220 

Pro Thr ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu 
225 230 235 240 

Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe 
245 250 ' 255 

Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu 
260 265 270 

Leu Pro Gly Thr ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr 
275 280 285 

lie Pro Ala Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys 
290 295 300 

Pro Ser Asp Arg Asn cys Thr cys lie Pro lie Pro Ser Ser Trp Ala 
305 ~ 310 315 320 

Phe Ala Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu 
325 330 335 
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ser Leu Leu val Pro Phe val Gin Trp Phe val Gly Leu Ser Pro Thr 
340 345 350 

Val Trp Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu 
355 360 365 

Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys 
370 375 380 

Leu Trp Val Tyr lie 
385 

<210> 111 

<211> 966 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 32-164 + EGF (serotype d) sequence 



<400> 111 



atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cere xgggat 


tctttcccga 


tcaccagttg 


gacggcggcc 


gcatgaactc 


tgattccgaa 




tgcccgctgt 


ctcatgacgg 


ttactgcctg 


catgatggcg 


tatgcatgta 


catcgaagct 


180 


ctggacaaat 


acgcatgcaa 


ctgtgttgta 


ggttacatcg 


gcgaacgttg 


ccagtatcgc 


240 


gacctgaaat 


ggtgggaact 


gcgtaaggcg 


gccgcccctg 


caccgaacat 


ggagaacaca 


300 


acatcaggat 


tcctaggacc 


cctgctcgtg 


ttacaggcgg 


ggtttttctt 


gttgacaaga 


360 


atcctcacaa 


taccacagag 


tctagactcg 


tggtggactt 


ctctcaattt 


tctaggggga 


420 


gcacccacgt 


gtcctggcca 


aaattcgcag 


tccccaacct 


ccaatcactc 


accaacctct 


480 


tgtcctccaa 


tttgtcctgg 


ctatcgctgg 


atgtgtctgc 


ggcgttttat 


catattcctc 


540 


ttcatcctgc 


tgctatgcct 


catcttcttg 


ttggttcttc 


tggactacca 


aggtatgttg 


600 


cccgtttgtc 


ctctacttcc 


aggaacatca 


accaccagca 


cggggccatg 


caagacctgc 


660 


acgattcctg 


ctcaaggaac 


ctctatgttt 


ccctcttgtt 


gctgtacaaa 


accttcggac 


720 


ggaaactgca 


cttgtattcc 


catcccatca 


tcctgggctt 


tcgcaagatt 


cctatgggag 


780 


tgggcctcag 


tccgtttctc 


ctggctcagt 


ttactagtgc 


catttgttca 


gtggttcgta 


840 


gggctttccc 


ccactgtttg 


gctttcagtt 


atatggatga 


tgtggtattg 


ggggccaagt 


900 


ctgtacaaca 


tcttgagtcc 


ctttttacct 


ctattaccaa 


ttttcttttg 


tctttgggta 


960 


tacatt 












966 


<210> 112 
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<211> 322 
<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 32-164 + EGF (serotype d) sequence 
<400> 112 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Gly 
20 25 30 

Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr 
35 40 45 

Cys Leu His Asp Gly Val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr 
50 55 60 

Ala Cys Asn Cys Val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr Arg 
65 70 ' 75 80 

Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro Asn 
85 90 95 

Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin 
100 105 110 

Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu 
115 120 125 

Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys 
130 135 140 

Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr ser 
145 150 155 160 

Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe 
165 170 175 

lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val 
180 185 190 

Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly 
195 200 205 

Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala 
210 215 220 

Page 98 



sequence listing_ST25.txt 

Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro ser Asp 
225 230 235 240 

Gly Asn Cys Thr Cys He Pro He Pro Ser Ser Trp Ala Phe Ala Arg 
245 250 255 

Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu 
260 265 270 

val Pro Phe val Gin Trp Phe val Gly Leu Ser Pro Thr val Trp Leu 
275 280 285 

Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie 
290 295 300 

Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val 
305 310 315 320 

Tyr lie 

<210> 113 

<211> 966 

<212> DNA 

<213> artificial sequence 



<220> 

<223> 32-164 (Q129R) + EGF (serotype d) sequence 
<400> 113 



atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gacggcggcc 


gcatgaactc tgattccgaa 


120 


tgcccgctgt 


ctcatgacgg 


ttactgcctg 


catgatggcg 


tatgcatgta 


catcgaagct 


180 


ctggacaaat 


acgcatgcaa 


ctgtgttgta 


ggttacatcg 


gcgaacgttg 


ccagtatcgc 


240 


gacctgaaat 


ggtgggaact 


gcgtaaggcg 


gccgcccctg 


caccgaacat 


ggagaacaca 


300 


acatcaggat 


tcctaggacc 


cctgctcgtg 


ttacaggcgg 


ggtttttctt 


gttgacaaga 


360 


atcctcacaa 


taccacagag 


tctagactcg 


tggtggactt 


ctctcaattt 


tctaggggga 


420 


gcacccacgt 


gtcctggcca 


aaattcgcag 


tccccaacct 


ccaatcactc 


accaacctct 


480 


tgtcctccaa 


tttgtcctgg 


ctatcgctgg 


atgtgtctgc 


ggcgttttat 


catattcctc 


540 


ttcatcctgc 


tgctatgcct 


catcttcttg 


ttggttcttc 


tggactacca 


aggtatgttg 


600 


cccgtttgtc 


ctctacttcc 


aggaacatca 


accaccagca 


cggggccatg 


caagacctgc 


660 


acgattcctg 


ctcgaggaac 


ctctatgttt 


ccctcttgtt 


gctgtacaaa 


accttcggac 


720 
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ggaaactgca cttgtattcc catcccatca tcctgggctt tcgcaagatt cctatgggag 780 

tgggcctcag tccgtttctc ctggctcagt ttactagtgc catttgttca gtggttcgta 840 

gggctttccc ccactgtttg gctttcagtt atatggatga tgtggtattg ggggccaagt 900 

ctgtacaaca tcttgagtcc ctttttacct ctattaccaa ttttcttttg tctttgggta 960 

tacatt 966 

<210> 114 

<211> 322 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 32-164 (Q129R) + EGF (serotype d) 
sequence 

<400> 114 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Gly 
20 25 30 

Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr 
35 40 45 

Cys Leu His Asp Gly Val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr 
50 55 60 

Ala Cys Asn Cys val val Gly Tyr lie Gly Glu Arg Cys Gin Tyr Arg 
65 70 75 80 

Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro Asn 
85 90 95 

Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin 
100 105 110 

Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu 
115 120 125 

Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys 
130 135 140 

Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser 
145 150 155 160 

Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe 

Page 100 



sequence listing_ST25.txt 
165 170 175 

lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val 
180 185 190 

Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro Gly 
195 200 205 

Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala 
210 215 220 



Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp 
225 230 235 240 

Gly Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg 
245 250 255 

Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu 
260 265 270 

Val Pro Phe Val Gin Trp Phe val Gly Leu ser Pro Thr val Trp Leu 
275 280 ' 285 

Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie 
290 295 300 

Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val 
305 310 315 320 

Tyr lie 



<210> 115 
<211> 966 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 32-164 (G145R) + EGF (serotype d) sequence 
<400> 115 

atgggaggtt ggtcttccaa acctcggaaa ggcatgggga cgaatctttc tgttcccaat 60 

cctctgggat tctttcccga tcaccagttg gacggcggcc gcatgaactc tgattccgaa 120 

tgcccgctgt ctcatgacgg ttactgcctg catgatggcg tatgcatgta catcgaagct 180 

ctggacaaat acgcatgcaa ctgtgttgta ggttacatcg gcgaacgttg ccagtatcgc 240 

gacctgaaat ggtgggaact gcgtaaggcg gccgcccctg caccgaacat ggagaacaca 300 

acatcaggat tcctaggacc cctgctcgtg ttacaggcgg ggtttttctt gttgacaaga 360 
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atcctcacaa taccacagag tctagactcg tggtggactt ctctcaattt tctaggggga 420 

gcacccacgt gtcctggcca aaattcgcag tccccaacct ccaatcactc accaacctct 480 

tgtcctccaa tttgtcctgg ctatcgctgg atgtgtctgc ggcgttttat catattcctc 540 

ttcatcctgc tgctatgcct catcttcttg ttggttcttc tggactacca aggtatgttg 600 

cccgtttgtc ctctacttcc aggaacatca accaccagca cggggccatg caagacctgc 660 

acgattcctg ctcaaggaac ctctatgttt ccctcttgtt gctgtacaaa accttcggac 720 

agaaactgca cttgtattcc catcccatca tcctgggctt tcgcaagatt cctatgggag 780 

tgggcctcag tccgtttctc ctggctcagt ttactagtgc catttgttca gtggttcgta 840 

gggctttccc ccactgtttg gctttcagtt atatggatga tgtggtattg ggggccaagt 900 

ctgtacaaca tcttgagtcc ctttttacct ctattaccaa ttttcttttg tctttgggta 960 

tacatt 966 

<210> 116 

<211> 322 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 32-164 (G145R) + EGF (serotype d) 
sequence 

<400> 116 

Met Gly Gly Trp ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Gly 
20 25 30 

Gly Arg Met Asn ser Asp Ser Glu Cys Pro Leu ser His Asp Gly Tyr 
35 40 45 

Cys Leu His Asp Gly val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr 
50 55 60 

Ala Cys Asn Cys Val Val Gly Tyr He Gly Glu Arg Cys Gin Tyr Arg 
65 * 70 75 80 

Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro Asn 
85 90 95 

Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin 
100 105 110 
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Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu 
115 120 125 

Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys 
130 135 140 

Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser 
145 150 155 160 

Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe 
165 170 175 

lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val 
180 185 190 

Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly 
195 200 205 

Thr ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala 
210 215 220 

Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp 
225 230 235 240 

Arg Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg 
245 250 255 

Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu 
260 265 270 

val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr val Trp Leu 
275 280 285 

Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie 
290 295 300 

Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val 
305 310 315 320 

Tyr lie 



<210> 117 

<211> 966 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 32-164 (Q129R/G145R) + EGF (serotype d) sequence 
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<400> 117 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


gqcatgggqa 

ZJ Zj Zj ZJ ^3 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gacggcggcc 


gcatgaactc 


tgattccgaa 


120 


tqcccqctqt 


ctcatgacgg 


ttactgcctg 


catgatggcg 


tatgcatgta 


catcgaagct 


180 


ctggacaaat 


acgcatgcaa 


ctgtgttgta 


ggttacatcg 


gcgaacgttg 

zj zj z/ zj 


ccagtatcgc 


240 


gacctgaaat 


ggtgggaact 


gcgtaaggcg 

zj zj zj zj zj 


gccgcccctg 


caccgaacat 


ggagaacaca 


300 


acatcaggat 


tcctaggacc 


cctgctcgtg 


ttacaggcgg 


ggtttttctt 


gttgacaaga 


360 


atcctcacaa 


taccacagag 


tctagactcg 


tggtggactt 


ctctcaattt 


tctaggggga 


420 


gcacccacgt 


gtcctggcca 


aaattcgcag 


tccccaacct 


ccaatcactc 


accaacctct 


480 


tgtcctccaa 


tttgtcctgg 


ctatcgctgg 


atgtgtctgc 


ggcgttttat 


catattcctc 


540 


ttcatcctgc 


tgctatgcct 

ZJ zj 


catcttcttg 


ttggttcttc 


tggactacca 


aggtatgttg 

ZJ ZJ ZJ ZJ 


600 


cccgtttgtc 


ctctacttcc 


aggaacatca 


accaccagca 


cggggccatg 

ZJ ZJ ZJ ZJ ZJ 


caagacctgc 


660 


acgattcctg 


ctcgaggaac 


ctctatgttt 


ccctcttgtt 


gctgtacaaa 


accttcggac 


720 


agaaactgca 


cttgtattcc 


catcccatca 


tcctgggctt 


tcgcaagatt 


cctatgggag 


780 


tgggcctcag 


tccgtttctc 


ctggctcagt 


ttactagtgc 


catttgttca 


gtggttcgta 


840 


gggctttccc 


ccactgtttg 


gctttcagtt 


atatggatga 


tgtggtattg 


ggggccaagt 


900 


ctgtacaaca 


tcttgagtcc 


ctttttacct 


ctattaccaa 


ttttcttttg 


tctttgggta 


960 


tacatt 
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<210> 118 

<211> 322 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 32-164 (Q129R/G145R) + EGF (serotype d) 
sequence 

<400> 118 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 ' 5 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Gly 
20 25 30 

Gly Arg Met Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr 
35 40 45 

Cys Leu His Asp Gly val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr 
50 55 60 
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Ala Cys Asn Cys Val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr Arg 
65 70 75 80 

Asp Leu Lys Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro Asn 
85 90 95 

Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin 
100 105 110 

Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu 
115 120 125 

Asp Ser Trp Trp Thr ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys 
130 135 140 

Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr ser 
145 150 155 160 

Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe 
165 170 175 

lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val 
180 185 190 

Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly 
195 " 200 205 

Thr Ser Thr Thr ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala 
210 215 220 

Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp 
225 230 235 240 

Arg Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg 
245 250 255 

Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu 
260 265 270 

val Pro Phe val Gin Trp Phe val Gly Leu Ser Pro Thr val Trp Leu 
275 280 ' 285 

Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie 
290 295 300 

Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val 
305 310 315 320 
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Tyr lie 

<210> 119 

<211> 825 

<212> DNA 

<213> artificial sequence 



<220> 

<223> 32-164 + STR-tag (streptavidin) (serotype d) sequence 



<400> 119 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gacggcggcc 


gctggagcca 


cccgcagttc 


120 


gaaaaagcgg 


ccgcccctgc 


accgaacatg 


gagaacacaa 


catcaggatt 


cctaggaccc 


180 


ctgctcgtgt 


tacaggcggg 


gtttttcttg 


ttgacaagaa 


tcctcacaat 


accacagagt 


240 


ctagactcgt 


ggtggacttc 


tctcaatttt 


ctagggggag 


cacccacgtg 


tcctggccaa 


300 


aattcgcagt 


ccccaacctc 


caatcactca 


ccaacctctt 


gtcctccaat ttgtcctggc 


360 


tatcgctgga 


tgtgtctgcg 


gcgttttatc 


atattcctct 


tcatcctgct 


gctatgcctc 


420 


atcttcttgt 


tggttcttct 


ggactaccaa 


ggtatgttgc 


ccgtttgtcc 


tctacttcca 


480 


ggaacatcaa 


ccaccagcac 


ggggccatgc 


aagacctgca 


cgattcctgc 


tcaaggaacc 


540 


tctatgtttc 


cctcttgttg 


ctgtacaaaa 


ccttcggacg 


gaaactgcac 


ttgtattccc 


600 


atcccatcat 


cctgggcttt 


cgcaagattc 


ctatgggagt 


gggcctcagt 


ccgtttctcc 


660 


tggctcagtt 


tactagtgcc 


atttgttcag 


tggttcgtag 


ggctttcccc 


cactgtttgg 


720 


ctttcagtta 


tatggatgat 


gtggtattgg 


gggccaagtc 


tgtacaacat 


cttgagtccc 


780 


tttttacctc 


tattaccaat 


tttcttttgt 


ctttgggtat 


acatt 




825 



<210> 120 

<211> 275 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 32-164 + STR-tag (streptavidin) 
(serotype d) sequence 

<400> 120 

Met Gly Gly Trp ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Gly 
20 25 30 
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Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala Pro 
35 40 45 

Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu 
50 55 60 

Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser 
65 70 75 80 

Leu Asp ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr 
85 90 95 

Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr 
100 105 110 

Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg 
115 120 125 

Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu 
130 135 140 

val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro 
145 150 * 155 160 

Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro 
165 170 175 

Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser 
180 185 190 

Asp Gly Asn Cys Thr Cys lie Pro lie Pro Ser ser Trp Ala Phe Ala 
195 ' 200 205 

Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu 
210 215 220 

Leu Val Pro Phe Val Gin Trp Phe val Gly Leu Ser Pro Thr val Trp 
225 230 235 240 

Leu Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn 
245 250 255 

lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp 
260 265 270 

Val Tyr lie 
275 
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<210> 121 
<211> 825 
<212> DNA 

<213> artificial sequence 



<220> 

<223> 32-164 (Q129R) + STR (serotype d) sequence 



<400> 121 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gacggcggcc 


gctggagcca 


cccgcagttc 


120 


gaaaaagcgg 


ccgcccctgc 


accgaacatg 


gagaacacaa 


catcaggatt 


cctaggaccc 


180 


ctgctcgtgt 


tacaggcggg 


gtttttcttg 


ttgacaagaa 


tcctcacaat accacagagt 


240 


ctagactcgt 


ggtggacttc 


tctcaatttt 


ctagggggag 


cacccacgtg tcctggccaa 


300 


aattcgcagt 


ccccaacctc 


caatcactca 


ccaacctctt 


gtcctccaat ttgtcctggc 


360 


tatcgctgga 


tgtgtctgcg 


gcgttttatc 


atattcctct 


tcatcctgct 


gctatgcctc 


420 


atcttcttgt 


tggttcttct 


ggactaccaa 


ggtatgttgc 


ccgtttgtcc 


tctacttcca 


480 


ggaacatcaa 


ccaccagcac 


ggggccatgc 


aagacctgca 


cgattcctgc 


tcgaggaacc 


540 


tctatgtttc 


cctcttgttg 


ctgtacaaaa 


ccttcggacg 


gaaactgcac 


ttgtattccc 


600 


atcccatcat 


cctgggcttt 


cgcaagattc 


ctatgggagt 


gggcctcagt 


ccgtttctcc 


660 


tggctcagtt 


tactagtgcc 


atttgttcag 


tggttcgtag 


ggctttcccc 


cactgtttgg 


720 


ctttcagtta 


tatggatgat 


gtggtattgg 


gggccaagtc 


tgtacaacat 


cttgagtccc 


780 


tttttacctc 


tattaccaat 


tttcttttgt 


ctttgggtat 


acatt 




825 



<210> 122 

<211> 275 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 32-164 (Q129R) + STR (serotype d) 
sequence 

<400> 122 

Met Gly Gly Trp ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 



Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Gly 
20 25 30 



Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala Pro 
35 40 45 



Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu 
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50 55 60 

Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin ser 
65 70 75 80 

Leu Asp ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr 
85 90 95 

Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr 
100 105 110 

Ser cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg 
115 120 125 

Phe He lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu 
130 135 140 

Val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro 
145 150 155 160 

Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro 
165 170 175 

Ala Arg Gly Thr ser Met Phe Pro ser Cys Cys Cys Thr Lys Pro Ser 
180 185 190 

Asp Gly Asn Cys Thr cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala 
195 200 205 

Arg Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu 
210 215 " 220 

Leu val Pro Phe Val Gin Trp Phe val Gly Leu Ser Pro Thr val Trp 
225 230 235 240 

Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn 
245 250 255 

lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp 
260 265 270 

Val Tyr lie 
275 

<210> 123 
<211> 825 
<212> DNA 

<213> artificial sequence 
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<220> 

<223> 32-164 (G145R) + STR (serotype d) sequence 



<400> 123 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gacggcggcc 


gctggagcca 


cccgcagttc 


120 


gaaaaagcgg 


ccgcccctgc 


accgaacatg 


gagaacacaa 


catcaggatt 


cctaggaccc 


180 


ctgctcgtgt 


tacaggcggg 


gtttttcttg 


ttgacaagaa 


tcctcacaat 


accacagagt 


240 


ctagactcgt 


ggtggacttc 


tctcaatttt 


ctagggggag 


cacccacgtg 


tcctggccaa 


300 


aattcgcagt 


ccccaacctc 


caatcactca 


ccaacctctt 


gtcctccaat 


ttgtcctggc 


360 


tatcgctgga 


tgtgtctgcg 


gcgttttatc 


atattcctct 


tcatcctgct 


gctatgcctc 


420 


atcttcttgt 


tggttcttct 


ggactaccaa 


ggtatgttgc 


ccgtttgtcc 


tctacttcca 


480 


ggaacatcaa 


ccaccagcac 


ggggccatgc 


aagacctgca 


cgattcctgc tcaaggaacc 


540 


tctatgtttc 


cctcttgttg 


ctgtacaaaa 


ccttcggaca 


gaaactgcac 


ttgtattccc 


600 


atcccatcat 


cctgggcttt 


cgcaagattc 


ctatgggagt 


gggcctcagt 


ccgtttctcc 


660 


tggctcagtt 


tactagtgcc 


atttgttcag 


tggttcgtag 


ggctttcccc 


cactgtttgg 


720 


ctttcagtta 


tatggatgat 


gtggtattgg 


gggccaagtc 


tgtacaacat 


cttgagtccc 


780 


tttttacctc 


tattaccaat 


tttcttttgt 


ctttgggtat 


acatt 




825 



<210> 124 

<211> 275 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 32-164 (G145R) + STR (serotype d) 
sequence 

<400> 124 

Met Gly Gly Trp ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Gly 
20 25 30 

Gly Arg Trp ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala Pro 
35 40 45 

Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu 
50 55 60 



Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser 
65 70 75 80 
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Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr 
85 90 95 

Cys Pro Gly Gin Asn ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr 
100 105 110 

Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg 
115 120 125 

Phe lie lie Phe Leu Phe lie Leu Leu Leu cys Leu lie Phe Leu Leu 
130 135 140 

Val Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro 
145 150 * 155 160 

Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro 
165 170 175 

Ala Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser 
180 185 190 

Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala 
195 200 205 

Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu 
210 215 220 

Leu Val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr val Trp 
225 230 235 240 

Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn 
245 250 255 

lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp 
260 265 270 

val Tyr lie 
275 

<210> 125 

<211> 825 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 32-164 (Ql29R/Gl45R)/STR-tag (serotype d) sequence 

<400> 125 
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ggtcttccaa 


sequence listing_ST25.txt 
acctcggaaa ggcatgggga cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gacggcggcc 


gctggagcca 


cccgcagttc 


120 


gaaaaagcgg 


ccgcccctgc 


accgaacatg 


gagaacacaa 


catcaggatt 


cctaggaccc 


180 


ctgctcgtgt 


tacaggcggg 


gtttttcttg 


ttgacaagaa 


tcctcacaat 


accacagagt 


240 


ctagactcgt 


ggtggacttc 


tctcaatttt 


ctagggggag 


cacccacgtg 


tcctggccaa 


300 


aattcgcagt 


ccccaacctc 


caatcactca 


ccaacctctt 


gtcctccaat ttgtcctggc 


360 


tatcgctgga 


tgtgtctgcg 


gcgttttatc 


atattcctct 


tcatcctgct 


gctatgcctc 


420 


atcttcttgt 


tggttcttct 


ggactaccaa 


ggtatgttgc 


ccgtttgtcc 


tctacttcca 


480 


ggaacatcaa 


ccaccagcac 


ggggccatgc 


aagacctgca 


cgattcctgc 


tcgaggaacc 


540 


tctatgtttc 


cctcttgttg 


ctgtacaaaa 


ccttcggaca 


gaaactgcac ttgtattccc 


600 


dLLLLdLLaL 


rrtnnnrttt 


cgcaagattc 


ctatgggagt 


gggcctcagt 


ccgtttctcc 


UUU 


tggctcagtt 


tactagtgcc 


atttgttcag 


tggttcgtag 


ggctttcccc 


cactgtttgg 


720 


ctttcagtta 


tatggatgat 


gtggtattgg 


gggccaagtc 


tgtacaacat 


cttgagtccc 


780 


tttttacctc 


tattaccaat 


tttcttttgt 


ctttgggtat 


acatt 




825 



<210> 126 

<211> 275 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 32-164 (Ql29R/Gl45R)/STR-tag (serotype 
d) sequence 

<400> 126 

Met Gly Gly Trp Ser ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Gly 
20 25 30 

Gly Arg Trp Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala Pro 
35 40 45 

Asn Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu 
50 55 60 

Gin Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser 
65 70 75 80 

Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr 
85 90 95 
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Cys Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr 
100 105 110 

Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg 
115 120 125 

Phe lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu 
130 135 140 

Val Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro 
145 150 155 160 

Gly Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr cys Thr lie Pro 
165 170 175 

Ala Arg Gly Thr ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser 
180 185 190 

Asp Arg Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala 
195 200 205 

Arg Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu 
210 215 ~ 220 

Leu val Pro Phe val Gin Trp Phe val Gly Leu Ser Pro Thr Val Trp 
225 230 235 240 

Leu Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn 
245 250 255 

lie Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp 
260 265 270 

val Tyr lie 
275 

<210> 127 
<211> 1203 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 44-164 + 22 (serotype d) sequence 
<400> 127 

atgggaggtt ggtcttccaa acctcggaaa ggcatgggga cgaatctttc tgttcccaat 60 

cctctgggat tctttcccga tcaccagttg gaccctgcgt tcggagccaa ctcaaacaat 120 

ccagattggg gcggccgcgc gcaacacgat gaagccgtag acaacaaatt caacaaagaa 180 
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caacaaaacg 


cgttctatga 


gatcttacat 


ttacctaact 


taaacgaaga 


acaacgaaac 


240 


gccttcatcc 


aaagtttaaa 


agatgaccca 


agccaaagcg 


ctaacctttt 


agcagaagct 


300 


aaaaagctaa 


atgatgctca 


ggcgccgaaa 


gtagacaaca 


aattcaacaa 


agaacaacaa 


360 


aacgcgttct 


atgagatctt 


acatttacct 


aacttaaacg 


aagaacaacg 


aaacgccttc 


420 


atccaaagtt 


taaaagatga 


cccaagccaa 


agcgctaacc 


ttttagcaga 


agctaaaaag 


480 


ctaaatgatg 


ctcaggcgcc 


gaaagcggcc 


gcccctgcac 


cgaacatgga 


gaacacaaca 


540 


tcaggattcc 


taggacccct 


gctcgtgtta 


caggcggggt 


ttttcttgtt 


gacaagaatc 


600 


ctcacaatac 


cacagagtct 


agactcgtgg 


tggacttctc 


tcaattttct 


agggggagca 


660 


cccacgtgtc 


ctggccaaaa 


ttcgcagtcc 


ccaacctcca 


atcactcacc 


aacctcttgt 


720 


cctccaattt 


gtcctggcta 


tcgctggatg 


tgtctgcggc 


gttttatcat 


attcctcttc 


780 


atcctgctgc 


tatgcctcat 


cttcttgttg 


gttcttctgg 


actaccaagg 


tatgttgccc 


840 


gtttgtcctc 


tacttccagg 


aacatcaacc 


accagcacgg 


ggccatgcaa 


gacctgcacg 


900 


attcctgctc 


aaggaacctc 


tatgtttccc 


tcttgttgct 


gtacaaaacc 


ttcggacgga 


960 


aactgcactt 


gtattcccat 


cccatcatcc 


taaactttca 


caaaattcct 


atgggagtgg 


1020 


gcctcagtcc 


gtttctcctg 


gctcagttta 


ctagtgccat 


ttgttcagtg 


gttcgtaggg 


1 ADA 


ctttccccca 


ctgtttggct 


ttcagttata 


tggatgatgt 


ggtattgggg 


gccaagtctg 


1140 


tacaacatct 


tgagtccctt 


tttacctcta 


ttaccaattt 


tcttttgtct 


ttgggtatac 


1200 


att 












1203 



<210> 128 

<211> 401 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 44-164 + ZZ (serotype d) sequence 

<400> 128 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Gly Gly Arg Ala Gin 
35 40 45 

His Asp Glu Ala val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala 
50 55 60 
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Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn 
65 70 75 80 

Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu 
85 90 95 

Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys val Asp 
100 105 110 

Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu He Leu His 
115 120 125 

Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu 
130 135 ~ 140 

Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys 
145 150 155 160 

Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala Pro Ala Pro Asn Met 
165 170 175 

Glu Asn Thr Thr ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin Ala 
180 185 190 

Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp 
195 200 205 

Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro 
210 215 220 

Gly Gin Asn Ser Gin Ser Pro Thr ser Asn His Ser Pro Thr Ser Cys 
225 230 235 240 

Pro Pro He Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie 
245 250 255 

lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val Leu 
260 265 270 

Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly Thr 
275 280 ' 285 

Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr cys Thr lie Pro Ala Gin 
290 295 300 

Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly 
305 310 315 320 
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Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe 
325 330 335 

Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu val 
340 345 350 

Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr val Trp Leu Ser 
355 360 365 

val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu 
370 375 380 

Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val Tyr 
385 390 395 400 

He 



<210> 129 

<211> 1203 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 44-164 (Q129R) + ZZ (serotype d) sequence 

<400> 129 



atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


gcggccgcgc 


gcaacacgat 


gaagccgtag 


acaacaaatt 


caacaaagaa 


180 


caacaaaacg 


cgttctatga 


gatcttacat 


ttacctaact 


taaacgaaga 


acaacgaaac 


240 


gccttcatcc 


aaagtttaaa 


agatgaccca 


agccaaagcg 


ctaacctttt 


agcagaagct 


300 


aaaaagctaa 


atgatgctca 


ggcgccgaaa 


gtagacaaca 


aattcaacaa 


agaacaacaa 


360 


aacgcgttct 


atgagatctt 


acatttacct 


aacttaaacg 


aagaacaacg 


aaacgccttc 


420 


atccaaagtt 


taaaagatga 


cccaagccaa 


agcgctaacc 


ttttagcaga 


agctaaaaag 


480 


ctaaatgatg 


ctcaggcgcc 


gaaagcggcc 


gcccctgcac 


cgaacatgga 


gaacacaaca 


540 


tcaggattcc 


taggacccct 


gctcgtgtta 


caggcggggt 


ttttcttgtt 


gacaagaatc 


600 


ctcacaatac 


cacagagtct 


agactcgtgg 


tggacttctc 


tcaattttct 


agggggagca 


660 


cccacgtgtc 


ctggccaaaa 


ttcgcagtcc 


ccaacctcca 


atcactcacc 


aacctcttgt 


720 


cctccaattt 


gtcctggcta 


tcgctggatg 


tgtctgcggc 


gttttatcat 


attcctcttc 


780 


atcctgctgc 


tatgcctcat 


cttcttgttg 


gttcttctgg 


actaccaagg 


tatgttgccc 


840 
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gtttgtcctc tacttccagg aacatcaacc accagcacgg ggccatgcaa gacctgcacg 900 

attcctgctc gaggaacctc tatgtttccc tcttgttgct gtacaaaacc ttcggacgga 960 

aactgcactt gtattcccat cccatcatcc tgggctttcg caagattcct atgggagtgg 1020 

gcctcagtcc gtttctcctg gctcagttta ctagtgccat ttgttcagtg gttcgtaggg 1080 

ctttccccca ctgtttggct ttcagttata tggatgatgt ggtattgggg gccaagtctg 1140 

tacaacatct tgagtccctt tttacctcta ttaccaattt tcttttgtct ttgggtatac 1200 

att 1203 

<210> 130 

<211> 401 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 44-164 (Q129R) + ZZ (serotype d) 
sequence 

<400> 130 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Gly Gly Arg Ala Gin 
35 40 45 

His Asp Glu Ala val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala 
50 55 60 

Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn 
65 70 75 80 

Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin ser Ala Asn Leu 
85 90 95 

Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys val Asp 
100 105 110 

Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His 
115 120 125 

Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu 
130 135 ^ 140 



Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys 
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145 150 155 160 

Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala Pro Ala Pro Asn Met 
165 170 175 

Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin Ala 
180 185 190 

Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp 
195 200 205 

Ser Trp Trp Thr ser Leu Asn Phe Leu Gly Gly Ala Pro Thr cys Pro 
210 215 220 

Gly Gin Asn ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser Cys 
225 230 235 240 

Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie 
245 ' 250 " 255 

lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val Leu 
260 265 270 

Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro Gly Thr 
275 280 285 

ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala Arg 
290 295 300 

Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly 
305 310 315 320 

Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe 
325 330 335 

Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu val 
340 345 350 

Pro Phe val Gin Trp Phe val Gly Leu Ser Pro Thr val Trp Leu Ser 
355 360 365 

val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu 
370 375 380 

Ser pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr 
385 390 395 400 
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He 



<210> 131 

<211> 1203 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 44-164 (G145R) + ZZ (serotype d) sequence 

<400> 131 



atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


gcggccgcgc 


gcaacacgat 


gaagccgtag 


acaacaaatt 


caacaaagaa 


180 


caacaaaacg 


cgttctatga 


gatcttacat 


ttacctaact 


taaacgaaga 


acaacgaaac 


240 


gccttcatcc 


aaagtttaaa 


agatgaccca 


agccaaagcg 


ctaacctttt 


agcagaagct 


300 


aaaaagctaa 


atgatgctca 


ggcgccgaaa 


gtagacaaca 


aattcaacaa 


agaacaacaa 


360 


aacgcgttct 


atgagatctt 


acatttacct 


aacttaaacg 


aagaacaacg 


aaacgccttc 


420 


atccaaagtt 


taaaagatga 


cccaagccaa 


agcgctaacc 


ttttagcaga 


agctaaaaag 


480 


ctaaatgatg 


ctcaggcgcc 


gaaagcggcc 


gcccctgcac 


cgaacatgga 


gaacacaaca 


540 


tcaggattcc 


taggacccct 


gctcgtgtta 


caqqcqqqqt 


ttttcttgtt 


gacaagaatc 


600 


ctcacaatac 


cacagagtct 


agactcgtgg 


tggacttctc 


tcaattttct 


agggggagca 


660 


cccacgtgtc 


ctggccaaaa 


ttcgcagtcc 


ccaacctcca 


atcactcacc 


aacctcttgt 


720 


cctccaattt 


gtcctggcta 


tcgctggatg 


tgtctgcggc 


gttttatcat 


attcctcttc 


780 


atcctgctgc 


tatgcctcat 


cttcttgttg 


gttcttctgg 


actaccaagg 


tatgttgccc 


840 


gtttgtcctc 


tacttccagg 


aacatcaacc 


accagcacgg 


ggccatgcaa 


gacctgcacg 


900 


attcctgctc 


aaggaacctc 


tatgtttccc 


tcttgttgct 


gtacaaaacc 


ttcggacaga 


960 


aactgcactt 


gtattcccat 


cccatcatcc 


tgggctttcg 


caagattcct 


atgggagtgg 


1020 


gcctcagtcc 


gtttctcctg 


gctcagttta 


ctagtgccat 


ttgttcagtg 


gttcgtaggg 


1080 


ctttccccca 


ctgtttggct 


ttcagttata 


tggatgatgt 


ggtattgggg 


gccaagtctg 


1140 


tacaacatct 


tgagtccctt 


tttacctcta 


ttaccaattt 


tcttttgtct 


ttgggtatac 


1200 


att 
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<210> 132 

<211> 401 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 44-164 (G145R) + ZZ (serotype d) 
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sequence 
<400> 132 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 " 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Gly Gly Arg Ala Gin 
35 40 45 

His Asp Glu Ala Val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala 
50 55 60 

Phe Tyr Glu lie Leu His Leu pro Asn Leu Asn Glu Glu Gin Arg Asn 
65 70 75 80 

Ala Phe He Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu 
85 90 95 

Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Val Asp 
100 105 110 

Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His 
115 120 125 

Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu 
130 135 " 140 

Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys 
145 150 155 160 

Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala Pro Ala Pro Asn Met 
165 170 175 

Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin Ala 
180 185 190 

Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp 
195 200 205 

Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro 
210 215 220 

Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser Cys 
225 230 235 240 
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pro Pro lie Cys Pro 
245 



Gly 



Tyr 



Arg Trp Met Cys 
250 



Leu 



Arg 



Arg Phe lie 
255 



lie Phe Leu Phe lie 
260 



Leu 



Leu 



Leu Cys Leu lie 
265 



Phe 



Leu 



Leu val Leu 
270 



Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro Gly Thr 
275 280 285 



Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala Gin 
290 295 300 



Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro ser Asp Arg 
305 310 315 320 



Asn Cys Thr Cys lie Pro lie Pro Ser ser Trp Ala Phe Ala Arg Phe 
325 330 335 



Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu Val 
340 345 350 



Pro Phe Val Gin Trp Phe Val Gly Leu ser Pro Thr val Trp Leu ser 
355 360 365 



val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu 
370 375 380 



Ser pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val Tyr 
385 390 395 400 



He 



<210> 133 
<211> 1203 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 44-164 (Q129R/G145R) + zz (serotype d) sequence 
<400> 133 

atgggaggtt ggtcttccaa acctcggaaa ggcatgggga cgaatctttc tgttcccaat 60 

cctctgggat tctttcccga tcaccagttg gaccctgcgt tcggagccaa ctcaaacaat 120 

ccagattggg gcggccgcgc gcaacacgat gaagccgtag acaacaaatt caacaaagaa 180 

caacaaaacg cgttctatga gatcttacat ttacctaact taaacgaaga acaacgaaac 240 

gccttcatcc aaagtttaaa agatgaccca agccaaagcg ctaacctttt agcagaagct 300 

Page 121 



sequence listing_ST25.txt 

aaaaagctaa atgatgctca ggcgccgaaa gtagacaaca aattcaacaa agaacaacaa 360 

aacgcgttct atgagatctt acatttacct aacttaaacg aagaacaacg aaacgccttc 420 

atccaaagtt taaaagatga cccaagccaa agcgctaacc ttttagcaga agctaaaaag 480 

ctaaatgatg ctcaggcgcc gaaagcggcc gcccctgcac cgaacatgga gaacacaaca 540 

tcaggattcc taggacccct gctcgtgtta caggcggggt ttttcttgtt gacaagaatc 600 

ctcacaatac cacagagtct agactcgtgg tggacttctc tcaattttct agggggagca 660 

cccacgtgtc ctggccaaaa ttcgcagtcc ccaacctcca atcactcacc aacctcttgt 720 

cctccaattt gtcctggcta tcgctggatg tgtctgcggc gttttatcat attcctcttc 780 

atcctgctgc tatgcctcat cttcttgttg gttcttctgg actaccaagg tatgttgccc 840 

gtttgtcctc tacttccagg aacatcaacc accagcacgg ggccatgcaa gacctgcacg 900 

attcctgctc gaggaacctc tatgtttccc tcttgttgct gtacaaaacc ttcggacaga 960 

aactgcactt gtattcccat cccatcatcc tgggctttcg caagattcct atgggagtgg 1020 

gcctcagtcc gtttctcctg gctcagttta ctagtgccat ttgttcagtg gttcgtaggg 1080 

ctttccccca ctgtttggct ttcagttata tggatgatgt ggtattgggg gccaagtctg 1140 

tacaacatct tgagtccctt tttacctcta ttaccaattt tcttttgtct ttgggtatac 1200 

att 1203 

<210> 134 

<211> 401 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 44-164 (Q129R/G145R) + ZZ (serotype d) 
sequence 

<400> 134 

Met Gly Gly Trp ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 



Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Gly Gly Arg Ala Gin 

35 40 45 

His Asp Glu Ala val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala 

50 55 60 

Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn 
65 70 75 80 
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Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu 
85 90 95 

Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys val Asp 
100 105 110 

Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu His 
115 120 125 

Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser Leu 
130 135 ~ 140 

Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys Lys 
145 150 155 160 

Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala Pro Ala Pro Asn Met 
165 170 175 

Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin Ala 
180 185 190 

Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp 
195 200 205 

Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro 
210 215 220 

Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser Cys 
225 230 235 240 

Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie 
245 250 255 

lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu 
260 265 270 

Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro Gly Thr 
275 280 285 

Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr cys Thr lie Pro Ala Arg 
290 295 300 

Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Arg 
305 310 315 320 

Asn Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe 
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325 330 " 335 

Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu val 
340 345 350 

pro Phe val Gin Trp Phe val Gly Leu ser Pro Thr val Trp Leu Ser 
355 360 365 

Val He Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu 
370 375 380 

Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val Tyr 
385 390 395 400 

He 



<210> 135 

<211> 1002 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 44-164 + EGF (serotype d) sequence 



<400> 135 
atgggaggtt 


ggtcttccaa 


acctcggaaa ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


gcggccgcat 


gaactctgat 


tccgaatgcc 


cgctgtctca 


tgacggttac 


180 


tgcctgcatg 


atggcgtatg 


catgtacatc 


gaagctctgg 


acaaatacgc 


atgcaactgt 


240 


gttgtaggtt 


acatcggcga 


acgttgccag 


tatcgcgacc 


tgaaatggtg 


ggaactgcgt 


300 


aaggcggccg 


cccctgcacc 


gaacatggag 


aacacaacat 


caggattcct 


aggacccctg 


360 


ctcgtgttac 


aggcggggtt 


tttcttgttg 


acaagaatcc 


tcacaatacc 


acagagtcta 


420 


gactcgtggt 


ggacttctct 


caattttcta 


gggggagcac 


ccacgtgtcc 


tggccaaaat 


480 


tcgcagtccc 


caacctccaa 


tcactcacca 


acctcttgtc 


ctccaatttg 


tcctggctat 


540 


cgctggatgt 


gtctgcggcg 


ttttatcata 


ttcctcttca 


tcctgctgct 


atgcctcatc 


600 


ttcttgttgg 


ttcttctgga 


ctaccaaggt 


atgttgcccg 


tttgtcctct 


acttccagga 


660 


acatcaacca 


ccagcacggg 


gccatgcaag 


acctgcacga 


ttcctgctca 


aggaacctct 


720 


atgtttccct 


cttgttgctg 


tacaaaacct 


tcggacggaa 


actgcacttg 


tattcccatc 


780 


ccatcatcct 


gggctttcgc 


aagattccta 


tgggagtggg 


cctcagtccg 


tttctcctgg 


840 


ctcagtttac 


tagtgccatt 


tgttcagtgg 


ttcgtagggc 


tttcccccac 


tgtttggctt 


900 


tcagttatat 


ggatgatgtg 


gtattggggg 


ccaagtctgt acaacatctt 
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gagtcccttt 


960 



♦ 
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ttacctctat taccaatttt cttttgtctt tgggtataca tt 1002 

<210> 136 

<211> 334 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 44-164 + EGF (serotype d) sequence 

<400> 136 

Met Gly Gly Trp ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn pro Asp Trp Gly Gly Arg Met Asn 
35 40 45 

Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His Asp 
50 55 60 

Gly val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala Cys Asn cys 
65 70 75 80 

val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr Arg Asp Leu Lys Trp 
85 90 ~ 95 

Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn Thr 
100 105 110 

Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin Ala Gly Phe Phe 
115 120 125 

Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp Trp 
130 135 140 

Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin Asn 
145 150 155 160 

Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser Cys Pro Pro lie 
165 170 175 

Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu 
180 " 185 ~ 190 

Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu Asp Tyr 
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195 200 205 

Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly Thr Ser Thr Thr 

210 215 220 

Ser Thr Gly Pro Cys Lys Thr cys Thr lie Pro Ala Gin Gly Thr Ser 

225 230 235 240 

Met Phe Pro Ser cys Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys Thr 

245 250 255 

Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp Glu 

260 265 " 270 

Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu val Pro Phe val 

275 280 285 

Gin Trp Phe val Gly Leu Ser Pro Thr val Trp Leu Ser val lie Trp 

290 295 300 

Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro Phe 

305 310 315 320 

Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val Tyr lie 
325 330 



<210> 137 
<211> 1002 
<212> DNA 

<213> artificial sequence 










<220> 

<223> 44-164 (Q129R) 


+ EGF (serotype d) sequence 






<400> 137 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


gcggccgcat 


gaactctgat 


tccgaatgcc 


cgctgtctca 


tgacggttac 


180 


tgcctgcatg 


atggcgtatg 


catgtacatc 


gaagctctgg 


acaaatacgc 


atgcaactgt 


240 


gttgtaggtt 


acatcggcga 


acgttgccag 


tatcgcgacc 


tgaaatggtg 


ggaactgcgt 


300 


aaggcggccg 


cccctgcacc 


gaacatggag 


aacacaacat 


caggattcct 


aggacccctg 


360 


ctcgtgttac 


aggcggggtt 


tttcttgttg 


acaagaatcc 


tcacaatacc 


acagagtcta 


420 


gactcgtggt 


ggacttctct 


caattttcta 


gggggagcac 


ccacgtgtcc 


tggccaaaat 


480 


tcgcagtccc 


caacctccaa 


tcactcacca 


acctcttgtc 


ctccaatttg 


tcctggctat 


540 


cgctggatgt 


gtctgcggcg 


ttttatcata 


ttcctcttca 


tcctgctgct 


atgcctcatc 


600 
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ttcttgttgg ttcttctgga ctaccaaggt atgttgcccg tttgtcctct acttccagga 660 

acatcaacca ccagcacggg gccatgcaag acctgcacga ttcctgctcg aggaacctct 720 

atgtttccct cttgttgctg tacaaaacct tcggacggaa actgcacttg tattcccatc 780 

ccatcatcct gggctttcgc aagattccta tgggagtggg cctcagtccg tttctcctgg 840 

ctcagtttac tagtgccatt tgttcagtgg ttcgtagggc tttcccccac tgtttggctt 900 

tcagttatat ggatgatgtg gtattggggg ccaagtctgt acaacatctt gagtcccttt 960 

ttacctctat taccaatttt cttttgtctt tgggtataca tt 1002 

<210> 138 

<211> 334 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 44-164 (Q129R) + EGF (serotype d) 
sequence 

<400> 138 

Met Gly Gly Trp ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Gly Gly Arg Met Asn 
35 40 45 

Ser Asp ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His Asp 
50 55 60 

Gly val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala cys Asn cys 
65 70 75 80 

Val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr Arg Asp Leu Lys Trp 
85 90 95 

Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn Thr 
100 105 110 

Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin Ala Gly Phe Phe 
115 120 125 

Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp Trp 
130 135 140 
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Thr Ser Leu Asn phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin Asn 
145 150 155 160 

Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr ser cys Pro Pro lie 
165 170 175 

Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu 
180 ~ 185 ~ 190 

Phe lie Leu Leu Leu Cys Leu He Phe Leu Leu val Leu Leu Asp Tyr 
195 200 205 

Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro Gly Thr Ser Thr Thr 
210 215 220 

Ser Thr Gly Pro Cys Lys Thr cys Thr lie Pro Ala Arg Gly Thr Ser 
225 230 235 240 

Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys Thr 
245 250 255 

Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp Glu 
260 265 " 270 

Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe val 
275 280 285 

Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser val lie Trp 
290 295 300 

Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn He Leu Ser Pro Phe 
305 310 315 320 

Leu Pro Leu Leu Pro He Phe Phe Cys Leu Trp Val Tyr lie 
325 330 

<210> 139 
<211> 1002 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 44-164 (G145R) + EGF (serotype d) sequence 
<400> 139 

atgggaggtt ggtcttccaa acctcggaaa ggcatgggga cgaatctttc tgttcccaat 60 
cctctgggat tctttcccga tcaccagttg gaccctgcgt tcggagccaa ctcaaacaat 120 
ccagattggg gcggccgcat gaactctgat tccgaatgcc cgctgtctca tgacggttac 180 
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tgcctgcatg 


atggcgtatg 


catgtacatc 


gaagctctgg 


acaaatacgc 


atgcaactgt 


240 


gttgtaggtt 


acatcggcga 


acgttgccag 


tatcgcgacc 


tgaaatggtg 


ggaactgcgt 


300 


aaggcggccg 


cccctgcacc 


gaacatggag 


aacacaacat 


caggattcct 


aggacccctg 


360 


ctcgtgttac 


aggcggggtt 


tttcttgttg 


acaagaatcc 


tcacaatacc 


acagagtcta 


420 


gactcgtggt 


ggacttctct 


caattttcta 


gggggagcac 


ccacgtgtcc 


tggccaaaat 


480 


tcgcagtccc 


caacctccaa 


tcactcacca 


acctcttgtc 


ctccaatttg 


tcctggctat 


540 


cgctggatgt 


gtctgcggcg 


ttttatcata 


ttcctcttca 


tcctgctgct 


atgcctcatc 


600 


ttcttgttgg 


ttcttctgga 


ctaccaaggt 


atgttgcccg 


tttgtcctct 


acttccagga 


660 


acatcaacca 


ccagcacggg 


gccatgcaag 


acctgcacga 


ttcctgctca 


aggaacctct 


720 


atgtttccct 


cttgttgctg 


tacaaaacct 


tcggacagaa 


actgcacttg 


tattcccatc 


780 


ccatcatcct 


gggctttcgc 


aagattccta 


tgggagtggg 


cctcagtccg 


tttctcctgg 


840 


ctcagtttac 


tagtgccatt 


tgttcagtgg 


ttcgtagggc 


tttcccccac 


tgtttggctt 


900 


tcagttatat 


ggatgatgtg 


gtattggggg 


ccaagtctgt 


acaacatctt 


gagtcccttt 


960 


ttacctctat 


taccaatttt 


cttttgtctt 


tgggtataca 


tt 




1002 



<210> 140 

<211> 334 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 44-164 (G145R) + EGF (serotype d) 
sequence 

<400> 140 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 



Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Gly Gly Arg Met Asn 
35 40 45 



Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His Asp 
50 55 60 



Gly Val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala Cys Asn cys 
65 70 75 80 

val val Gly Tyr lie Gly Glu Arg cys Gin Tyr Arg Asp Leu Lys Trp 
85 ' 90 95 
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~rrp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn Thr 
100 105 110 

Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin Ala Gly Phe Phe 
115 120 125 

Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp Trp 
130 135 140 

Thr ser Leu Asn Phe Leu Gly Gly Ala Pro Thr cys Pro Gly Gin Asn 
145 150 " 155 160 

Ser Gin ser Pro Thr Ser Asn His Ser Pro Thr ser cys Pro Pro lie 
165 170 175 

Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu 
180 " 185 190 

Phe lie Leu Leu Leu Cys Leu He Phe Leu Leu val Leu Leu Asp Tyr 
195 200 205 

Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly Thr ser Thr Thr 
210 215 220 

Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala Gin Gly Thr Ser 
225 230 235 240 

Met Phe Pro Ser cys Cys Cys Thr Lys Pro Ser Asp Arg Asn Cys Thr 
245 250 " 255 

Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp Glu 
260 265 270 

Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe Val 
275 280 285 

Gin Trp Phe val Gly Leu Ser Pro Thr val Trp Leu ser val lie Trp 
290 295 300 

Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro Phe 
305 310 315 320 

Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val Tyr lie 
325 330 

<210> 141 
<211> 1002 
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<212> DNA 

<213> artificial sequence 
<220> 

<223> 44-164 (Q129R/G145R) + EGF (serotype d) sequence 



<400> 141 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa ctcaaacaat 


120 


ccagattggg 


gcggccgcat 


gaactctgat 


tccgaatgcc 


cgctgtctca tgacggttac 


180 


tgcctgcatg 


atggcgtatg 


catgtacatc 


gaagctctgg 


acaaatacgc atgcaactgt 


240 


gttgtaggtt 


acatcggcga 


acgttgccag 


tatcgcgacc 


tgaaatggtg ggaactgcgt 


300 


aaggcggccg 


cccctgcacc 


gaacatggag 


aacacaacat 


caggattcct aggacccctg 


360 


ctcgtgttac 


aggcggggtt 


tttcttgttg 


acaagaatcc 


tcacaatacc acagagtcta 


420 


gactcgtggt 


ggacttctct 


caattttcta 


gggggagcac 


ccacgtgtcc tggccaaaat 


480 


tcgcagtccc 


caacctccaa 


tcactcacca 


acctcttgtc 


ctccaatttg tcctggctat 


540 


cgctggatgt 


gtctgcggcg 


ttttatcata 


ttcctcttca 


tcctgctgct atgcctcatc 


600 


ttcttgttgg 


ttcttctgga 


ctaccaaggt 


atgttgcccg 


tttgtcctct acttccagga 


660 


acatcaacca 


ccagcacggg 


gccatgcaag 


acctgcacga 


ttcctgctcg aggaacctct 


720 


atgtttccct 


cttgttgctg 


tacaaaacct 


tcggacagaa 


actgcacttg tattcccatc 


780 


ccatcatcct 


gggctttcgc 


aagattccta 


tgggagtggg 


cctcagtccg tttctcctgg 


840 


ctcagtttac 


tagtgccatt 


tgttcagtgg 


ttcgtagggc 


tttcccccac tgtttggctt 


900 


tcagttatat 


ggatgatgtg 


gtattggggg 


ccaagtctgt 


acaacatctt gagtcccttt 


960 


ttacctctat 


taccaatttt 


cttttgtctt 


tgggtataca 


tt 


1002 



<210> 142 

<211> 334 

<212> PRT 

<213> arti f i ci al sequence 
<220> 

<223> Proteins corresponding to 44-164 (Q129R/G145R) + EGF (serotype d) 
sequence 

<400> 142 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 



Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Gly Gly Arg Met Asn 
35 40 45 
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Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His Asp 
50 55 60 

Gly Val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala Cys Asn cys 
65 70 75 80 

Val val Gly Tyr lie Gly Glu Arg Cys Gin Tyr Arg Asp Leu Lys Trp 
85 90 " 95 

Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn Thr 
100 105 110 

Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin Ala Gly Phe Phe 
115 120 125 

Leu Leu Thr Arg lie Leu Thr He Pro Gin ser Leu Asp Ser Trp Trp 
130 135 140 

Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin Asn 
145 150 155 160 

Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser Cys Pro Pro lie 
165 170 175 

Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu 
180 185 " 190 

Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val Leu Leu Asp Tyr 
195 200 205 

Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro Gly Thr Ser Thr Thr 
210 215 220 

Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala Arg Gly Thr Ser 
225 230 235 ~ 240 

Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Arg Asn cys Thr 
245 250 ~ 255 

Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp Glu 
260 265 " 270 

Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu val Pro Phe Val 
275 280 285 

Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser val lie Trp 
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290 295 300 



Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro Phe 
305 310 315 320 



Leu Pro Leu Leu Pro lie Phe Phe cys Leu Trp val Tyr lie 
325 330 



<210> 143 

<211> 861 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 44-164 + STR-tag (serotype d) sequence 



<400> 143 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


rrtrtnnnat 
v- v. ll Lyyya l 


LL L L LL,LL,ya 


trarranttfi 
LLdLLdy L Ly 




trnnanrraa 

LL.yyd.yn_d.ci 


rtraaaraat 
LLLaddCddL 




ccagattggg 


gcggccgctg 


gagccacccg 


cagttcgaaa 


aagcggccgc 


ccctgcaccg 


180 


aacatggaga 


acacaacatc 


aggattccta 


ggacccctgc 


tcgtgttaca 


ggcggggttt 


240 


ttcttgttga 


caagaatcct 


cacaatacca 


cagagtctag 


actcgtggtg 


gacttctctc 


300 


aattttctag 


ggggagcacc 


cacgtgtcct 


ggccaaaatt 


cgcagtcccc 


aacctccaat 


360 


cactcaccaa 


cctcttgtcc 


tccaatttgt 


cctggctatc 


gctggatgtg 


tctgcggcgt 


420 


tttatcatat 


tcctcttcat 


cctgctgcta 


tgcctcatct 


tcttgttggt 


tcttctggac 


480 


taccaaggta 


tgttgcccgt 


ttgtcctcta 


cttccaggaa 


catcaaccac 


cagcacgggg 


540 


ccatgcaaga 


cctgcacgat 


tcctgctcaa 


ggaacctcta 


tgtttccctc 


ttgttgctgt 


600 


acaaaacctt 


cggacggaaa 


ctgcacttgt 


attcccatcc 


catcatcctg 


ggctttcgca 


660 


agattcctat 


gggagtgggc 


ctcagtccgt 


ttctcctggc 


tcagtttact 


agtgccattt 


720 


cjttcagtggt 


tcgtagggct 


ttcccccact 


gtttggcttt 


cagttatatg 


gatgatgtgg 


780 


tattgggggc 


caagtctgta 


caacatcttg 


agtccctttt 


tacctctatt 


accaattttc 


840 


ttttgtcttt 


gggtatacat 


t 








861 



<:210> 144 
<211> 287 
<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 44-164 + STR-tag (serotype d) sequence 
<400> 144 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 ' 15 
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Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Gly Gly Arg Trp Ser 
35 40 45 

His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn 
50 55 60 

Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin Ala Gly Phe 
65 70 75 80 

Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp 
85 90 95 

Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin 
100 105 110 

Asn ser Gin Ser Pro Thr Ser Asn His ser Pro Thr ser cys Pro Pro 
115 120 125 

lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe 
130 135 140 

Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu Asp 
145 150 155 160 

Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro Gly Thr Ser Thr 
165 * 170 175 

Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala Gin Gly Thr 
180 185 190 

Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys 
195 200 205 

Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp 
210 215 220 

Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe 
225 230 235 240 

Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser val lie 
245 250 255 

Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro 
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260 265 270 



Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr lie 
275 280 285 



<210> 145 

<211> 861 

<212> DNA 

<213> artificial sequence 



<220> 

<223> 44-164 (Q129R) + STR-tag (serotype d) sequence 



<400> 145 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


gcggccgctg 


gagccacccg 


cagttcgaaa 


aagcggccgc 


ccctgcaccg 


180 


clcl^cl i_ y y a. y ci 


CI l_ CL V_ CL CL l~ Cl LL 








yy<-ygyy 




ttcttgttga 


caagaatcct 


cacaatacca 


cagagtctag 


actcgtggtg 


gacttctctc 


300 


aattttctag 


ggggagcacc 


cacgtgtcct 


ggccaaaatt 


cgcagtcccc 


aacctccaat 


360 


cactcaccaa 


cctcttgtcc 


tccaatttgt 


cctggctatc 


gctggatgtg 


tctgcggcgt 


420 


tttatcatat 


tcctcttcat 


cctgctgcta 


tgcctcatct 


tcttgttggt 


tcttctggac 


480 


taccaaggta 


tgttgcccgt 


ttgtcctcta 


cttccaggaa 


catcaaccac 


cagcacgggg 


540 


ccatgcaaga 


cctgcacgat 


tcctgctcga 


ggaacctcta 


tgtttccctc 


ttgttgctgt 


600 


acaaaacctt 


cggacggaaa 


ctgcacttgt 


attcccatcc 


catcatcctg 


ggctttcgca 


660 


agattcctat 


gggagtgggc 


ctcagtccgt 


ttctcctggc 


tcagtttact 


agtgccattt 


720 


gttcagtggt 


tcgtagggct 


ttcccccact 


gtttggcttt 


cagttatatg 


gatgatgtgg 


780 


tattgggggc 


caagtctgta 


caacatcttg 


agtccctttt 


tacctctatt 


accaattttc 


840 


ttttgtcttt 


gggtatacat 


t 








861 



<210> 146 

<211> 287 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins correspnding to 44-164 (Q129R) + STR-tag (serotype d) 
sequence 

<400> 146 

Met Gly Gly Trp ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 



Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
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20 25 - 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Gly Gly Arg Trp Ser 
35 40 45 

His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn 
50 55 60 

Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin Ala Gly Phe 
65 70 75 80 

Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp 
85 90 95 

Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin 
100 105 110 

Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr ser Cys Pro Pro 
115 120 125 

lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe 
130 135 140 

Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val Leu Leu Asp 
145 150 155 160 

Tyr Gin Gly Met Leu Pro Val cys Pro Leu Leu Pro Gly Thr Ser Thr 
165 170 175 

Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala Arg Gly Thr 
180 185 190 

Ser Met Phe Pro Ser cys Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys 
195 200 205 

Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp 
210 215 220 

Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu val Pro Phe 
225 230 235 240 

val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu Ser val lie 
245 250 255 

Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu ser Pro 
260 265 270 
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Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val Tyr lie 
275 280 285 



<210> 147 

<211> 861 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 44-164 (G145R) + STR-tag (serotype d) sequence 



<400> 147 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


gcggccgctg 


gagccacccg 


cagttcgaaa 


aagcggccgc 


ccctgcaccg 


180 


aacatggaga 


acacaacatc 


aggattccta 


ggacccctgc 


tcgtgttaca 


ggcggggttt 


240 


ttcttgttga 


caagaatcct 


cacaatacca 


cagagtctag 


actcgtggtg 


gacttctctc 


300 


aattttctag 


ggggagcacc 


cacgtgtcct 


ggccaaaatt 


cgcagtcccc 


aacctccaat 


360 


cactcaccaa 


cctcttgtcc 


tccaatttgt 


cctggctatc 


gctggatgtg 


tctgcggcgt 


420 


tttatcatat 


tcctcttcat 


cctgctgcta 


tgcctcatct 


tcttgttggt 


tcttctggac 


480 


taccaaggta 


tgttgcccgt 


ttgtcctcta 


cttccaggaa 


catcaaccac 


cagcacgggg 


540 


ccatgcaaga 


cctgcacgat 


tcctgctcaa 


ggaacctcta 


tgtttccctc 


ttgttgctgt 


600 


acaaaacctt 


cggacagaaa 


ctgcacttgt 


attcccatcc 


catcatcctg 


ggctttcgca 


660 


agattcctat 


gggagtgggc 


ctcagtccgt 


ttctcctggc 


tcagtttact 


agtgccattt 


720 


gttcagtggt 


tcgtagggct 


ttcccccact 


gtttggcttt 


cagttatatg 


gatgatgtgg 


780 


tattgggggc 


caagtctgta 


caacatcttg 


agtccctttt 


tacctctatt 


accaattttc 


840 


ttttgtcttt 


gggtatacat 


t 








861 



<210> 148 

<211> 286 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 44-164 (G145R) + STR-tag (serotype d) 
sequence 

<400> 148 

Met Gly Gly Trp Ser ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 



Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 
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Ala Phe Gly Ala Asn ser Asn Asn Pro Asp Trp Gly Gly Arg Trp Ser 
35 40 45 

His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn 
50 55 60 

Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin Ala Gly Phe 
65 70 75 80 

Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp 
85 90 95 

Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin 
100 105 110 

Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr ser Cys Pro Pro 
115 120 125 

lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe 
130 135 140 

Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val Leu Leu Asp 
145 150 155 160 

Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly Thr Ser Thr 
165 170 175 

Thr Ser Thr Gly Pro Cys Lys Thr cys Thr lie Pro Ala Gin Gly Thr 
180 185 190 

Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Arg Asn Cys 
195 200 205 

Thr Cys lie Pro lie Pro Ser ser Trp Ala Phe Ala Arg Phe Leu Trp 
210 215 220 

Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe 
225 230 235 240 

val Gin Trp Phe val Gly Leu ser Pro Thr val Trp Leu Ser Val lie 
245 250 255 

Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro 
260 265 270 

Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr 
275 280 285 
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<210> 149 

<211> 861 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 44-164 (Q129R/G145R) + STR-tag (serotype d) sequence 



<400> 149 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


gcggccgctg 


gagccacccg 


cagttcgaaa 


aagcggccgc 


ccctgcaccg 


180 


aacatggaga 


acacaacatc 


aggattccta 


ggacccctgc 


tcgtgttaca 


ggcggggttt 


240 


ttcttgttga 


caagaatcct 


cacaatacca 


cagagtctag 


actcgtggtg 


gacttctctc 


300 


aattttctag 


ggggagcacc 


cacgtgtcct 


ggccaaaatt 


cgcagtcccc 


aacctccaat 


360 


cactcaccaa 


cctcttatcc 


tccaatttat 






*-y*-yy *-y *- 


420 


tttatcatat 


tcctcttcat 


cctgctgcta 


tgcctcatct 


tcttgttggt 


tcttctggac 


480 


taccaaggta 


tgttgcccgt 


ttgtcctcta 


cttccaggaa 


catcaaccac 


cagcacgggg 


540 


ccatgcaaga 


cctgcacgat 


tcctgctcga 


ggaacctcta 


tgtttccctc 


ttgttgctgt 


600 


acaaaacctt 


cggacagaaa 


ctgcacttgt 


attcccatcc 


catcatcctg 


ggctttcgca 


660 


agattcctat 


gggagtgggc 


ctcagtccgt 


ttctcctggc 


tcagtttact 


agtgccattt 


720 


gttcagtggt 


tcgtagggct 


ttcccccact 


gtttggcttt 


cagttatatg 


gatgatgtgg 


780 


tattgggggc 


caagtctgta 


caacatcttg 


agtccctttt 


tacctctatt 


accaattttc 


840 


ttttgtcttt 


gggtatacat 


t 








861 



<210> 150 

<211> 287 

<212> PRT 

<213> arti f i ci al sequence 
<220> 

<223> Proteins corresponding to 44-164 (Q129R/G145R) + STR-tag 
(serotype d) sequence 

<400> 150 

Met Gly Gly Trp Ser ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 



Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 



Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Gly Gly Arg Trp ser 
35 40 45 
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His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn 
50 55 60 

Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin Ala Gly Phe 
65 70 75 80 

Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp 
85 90 95 

Trp Thr ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin 
100 105 110 

Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr ser Cys Pro Pro 
115 120 125 

lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe 
130 135 140 

Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val Leu Leu Asp 
145 150 155 160 

Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly Thr Ser Thr 
165 ' 170 175 

Thr Ser Thr Gly Pro Cys Lys Thr cys Thr lie Pro Ala Arg Gly Thr 
180 185 190 

Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Arg Asn Cys 
195 200 205 

Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp 
210 215 220 

Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu val Pro Phe 
225 230 235 240 

val Gin Trp Phe Val Gly Leu Ser Pro Thr val Trp Leu Ser val lie 
245 250 255 

Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro 
260 265 270 

Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val Tyr lie 
275 280 285 

<210> 151 
<211> 1254 
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<212> DNA 

<213> artificial sequence 
<220> 

<223> 61-164 + zz (serotype d) sequence 



<400> 151 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


acttcaaccc 


caacaaggat 


caatggccag 


aggcaaatca ggtaggagcg 


180 


ggcggccgcg 


cgcaacacga 


tgaagccgta 


gacaacaaat 


tcaacaaaga 


acaacaaaac 


240 


gcgttctatg 


agatcttaca 


tttacctaac 


ttaaacgaag 


aacaacgaaa 


cgccttcatc 


300 


caaagtttaa 


aagatgaccc 


aagccaaagc 


gctaaccttt 


tagcagaagc 


taaaaagcta 


360 


aatgatgctc 


aggcgccgaa 


agtagacaac 


aaattcaaca 


aagaacaaca 


aaacgcgttc 


420 


tatgagatct 


tacatttacc 


taacttaaac 


gaagaacaac 


gaaacgcctt 


catccaaagt 


480 


ttaaaagatg 


acccaagcca 


aagcgctaac 


cttttagcag 


aagctaaaaa gctaaatgat 


540 


gctcaggcgc 


cgaaagcggc 


cgcccctgca 


ccgaacatgg 


agaacacaac 


atcaggattc 


600 


ctaggacccc 


tgctcgtgtt 


acaggcgggg 


tttttcttgt 


tgacaagaat 


cctcacaata 


660 


ccacagagtc 


tagactcgtg 


gtggacttct 


ctcaattttc 


tagggggagc 


acccacgtgt 


720 


cctggccaaa 


attcgcagtc 


cccaacctcc 


aatcactcac 


caacctcttg 


tcctccaatt 


780 


tgtcctggct 


atcgctggat 


gtgtctgcgg 


cgttttatca 


tattcctctt 


catcctgctg 


840 


ctatgcctca 


tcttcttgtt 


ggttcttctg 


gactaccaag 


gtatgttgcc 


cgtttgtcct 


900 


ctacttccag 


gaacatcaac 


caccagcacg 


gggccatgca 


agacctgcac 


gattcctgct 


960 


caaggaacct 


ctatgtttcc 


ctcttgttgc 


tgtacaaaac 


cttcggacgg 


aaactgcact 


1020 


tgtattccca 


tcccatcatc 


ctgggctttc 


gcaagattcc 


tatgggagtg 


ggcctcagtc 


1080 


cgtttctcct 


ggctcagttt 


actagtgcca 


tttgttcagt 


ggttcgtagg 


gctttccccc 


1140 


actgtttggc 


tttcagttat 


atggatgatg 


tggtattggg 


ggccaagtct 


gtacaacatc 


1200 


ttgagtccct 


ttttacctct 


attaccaatt 


ttcttttgtc 


tttgggtata 


catt 


1254 



<210> 152 
<211> 418 
<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 61-164 + ZZ (serotype d) sequence 
<400> 152 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 
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Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Asp Phe Asn Pro Asn 
35 40 45 

Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly Ala Gly Gly Arg Ala 
50 55 60 

Gin His Asp Glu Ala Val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn 
65 70 75 80 

Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg 
85 90 95 

Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn 
100 105 110 

Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Val 
115 120 125 

Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu 
130 135 140 

His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin ser 
145 150 155 160 

Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys 
165 170 175 

Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala Pro Ala Pro Asn 
180 185 190 

Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin 
195 200 205 

Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin ser Leu 
210 215 220 

Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys 
225 230 235 240 

Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser 
245 250 255 

Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe 
260 265 270 
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lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val 
275 280 285 

Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly 
290 295 300 

Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala 
305 310 315 320 

Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp 
325 330 335 

Gly Asn Cys Thr cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg 
340 345 350 

Phe Leu Trp Glu Trp Ala ser val Arg Phe Ser Trp Leu Ser Leu Leu 
355 360 365 

Val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr val Trp Leu 
370 375 380 

Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie 
385 390 395 400 

Leu ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val 
405 410 415 

Tyr lie 



<210> 153 

<211> 1254 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 61-164 (Q129R) + zz (serotype d) sequence 

<400> 153 



atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


acttcaaccc 


caacaaggat 


caatggccag 


aggcaaatca 


ggtaggagcg 


180 


ggcggccgcg 


cgcaacacga 


tgaagccgta 


gacaacaaat 


tcaacaaaga 


acaacaaaac 


240 


gcgttctatg 


agatcttaca 


tttacctaac 


ttaaacgaag 


aacaacgaaa 


cgccttcatc 


300 


caaagtttaa 


aagatgaccc 


aagccaaagc 


gctaaccttt 


tagcagaagc taaaaagcta 


360 
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aggcgccgaa 


sequence listing_ST25.txt 
agtagacaac aaattcaaca aagaacaaca 


aaacgcgttc 


420 


tatgagatct 


tacatttacc 


taacttaaac 


gaagaacaac 


gaaacgcctt 


catccaaagt 


480 


ttaaaagatg 


acccaagcca 


aagcgctaac 


cttttagcag 


aagctaaaaa 


gctaaatgat 


540 


gctcaggcgc 


cgaaagcggc 


cgcccctgca 


ccgaacatgg 


agaacacaac 


atcaggattc 


600 


ctaggacccc 


tgctcgtgtt 


acaggcgggg 


tttttcttgt 


tgacaagaat 


cctcacaata 


660 


ccacagagtc 


tagactcgtg 


gtggacttct 


ctcaattttc 


tagggggagc 


acccacgtgt 


720 


cctggccaaa 


attcgcagtc 


cccaacctcc 


aatcactcac 


caacctcttg 


tcctccaatt 


780 


tgtcctggct 


atcgctggat 


gtgtctgcgg 


cgttttatca 


tattcctctt 


catcctgctg 


840 


ctatgcctca 


tcttcttgtt 


ggttcttctg 


gactaccaag 


gtatgttgcc 


cgtttgtcct 


900 


ctacttccag 


gaacatcaac 


caccagcacg 


gggccatgca 


agacctgcac 


gattcctgct 


960 


cgaggaacct 


ctatgtttcc 


ctcttgttgc 


tgtacaaaac 


cttcggacgg 


aaactgcact 


1020 


tgtattccca 


tcccatcatc 


ctgggctttc 


gcaagattcc 


tatgggagtg 


aacctcaatc 


1080 


cgtttctcct 


ggctcagttt 


actagtgcca 


tttgttcagt 


ggttcgtagg 


gctttccccc 


1140 


actgtttggc 


tttcagttat 


atggatgatg 


tggtattggg 


ggccaagtct 


gtacaacatc 


1200 


ttgagtccct 


ttttacctct 


attaccaatt 


ttcttttgtc 


tttgggtata 


catt 


1254 



<210> 154 

<211> 418 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 61-164 (Q129R) + zz (serotype d) 
sequence 

<400> 154 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Asp Phe Asn Pro Asn 
35 40 45 



Lys Asp Gin Trp Pro Glu Ala Asn Gin Val Gly Ala Gly Gly Arg Ala 
50 55 60 



Gin His Asp Glu Ala val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn 
65 70 75 80 



Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg 
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85 90 95 

Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn 
100 105 110 

Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Val 
115 120 125 

Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu 
130 135 140 

His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser 
145 150 155 160 

Leu Lys Asp Asp Pro Ser Gin ser Ala Asn Leu Leu Ala Glu Ala Lys 
165 170 175 

Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala Pro Ala Pro Asn 
180 185 190 

Met Glu Asn Thr Thr ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin 
195 200 205 

Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu 
210 215 220 

Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys 
225 230 235 240 

Pro Gly Gin Asn ser Gin Ser Pro Thr ser Asn His Ser Pro Thr ser 
245 250 255 

Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe 
260 265 270 

lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val 
275 280 285 

Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly 
290 * 295 300 

Thr Ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala 
305 310 315 320 

Arg Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp 
325 330 335 
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Gly Asn cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg 
340 345 350 



Phe Leu Trp Glu Trp Ala ser Val Arg Phe Ser Trp Leu Ser Leu Leu 
355 360 365 

Val Pro Phe Val Gin Trp Phe val Gly Leu Ser Pro Thr Val Trp Leu 
370 375 380 

Ser Val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie 
385 390 395 400 

Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val 
405 410 415 



Tyr lie 



<210> 155 

<211> 1254 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 61-164 (G145R) + ZZ (serotype d) sequence 



<400> 155 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


acttcaaccc 


caacaaggat 


caatggccag 


aggcaaatca 


ggtaggagcg 


180 


ggcggccgcg 


cgcaacacga 


tgaagccgta 


gacaacaaat 


tcaacaaaga 


acaacaaaac 


240 


gcgttctatg 


agatcttaca 


tttacctaac 


ttaaacgaag 


aacaacgaaa cgccttcatc 


300 


caaagtttaa 


aagatgaccc 


aagccaaagc 


gctaaccttt 


tagcagaagc 


taaaaagcta 


360 


aatgatgctc 


aggcgccgaa 


agtagacaac 


aaattcaaca 


aagaacaaca 


aaacgcgttc 


420 


tatgagatct 


tacatttacc 


taacttaaac 


gaagaacaac 


gaaacgcctt 


catccaaagt 


480 


ttaaaagatg 


acccaagcca 


aagcgctaac 


cttttagcag 


aagctaaaaa 


gctaaatgat 


540 


gctcaggcgc 


cgaaagcggc 


cgcccctgca 


ccgaacatgg 


agaacacaac 


atcaggattc 


600 


ctaggacccc 


tgctcgtgtt 


acaggcgggg 


tttttcttgt 


tgacaagaat 


cctcacaata 


660 


ccacagagtc 


tagactcgtg 


gtggacttct 


ctcaattttc 


tagggggagc 


acccacgtgt 


720 


cctggccaaa 


attcgcagtc 


cccaacctcc 


aatcactcac 


caacctcttg 


tcctccaatt 


780 


tgtcctggct 


atcgctggat 


gtgtctgcgg 


cgttttatca 


tattcctctt 


catcctgctg 


840 


ctatgcctca 


tcttcttgtt 


ggttcttctg 


gactaccaag 


gtatgttgcc 


cgtttgtcct 


900 
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ctacttccag 


gaacatcaac 


caccagcacg 


gggccatgca agacctgcac gattcctgct 


960 


caaggaacct 


ctatgtttcc 


ctcttgttgc 


tgtacaaaac 


cttcggacag 


aaactgcact 


1020 


tgtattccca 


tcccatcatc 


ctgggctttc 


gcaagattcc 


tatgggagtg 


ggcctcagtc 


1080 


cgtttctcct 


ggctcagttt 


actagtgcca 


tttgttcagt 


ggttcgtagg 


gctttccccc 


1140 


actgtttggc 


tttcagttat 


atggatgatg 


tggtattggg 


ggccaagtct 


gtacaacatc 


1200 


ttgagtccct 


ttttacctct 


attaccaatt 


ttcttttgtc 


tttgggtata 


catt 


1254 



<210> 156 

<211> 418 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 61-164 (G145R) + ZZ (serotype d) 
sequence 

<400> 156 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn ser Asn Asn Pro Asp Trp Asp Phe Asn Pro Asn 
35 40 45 

Lys Asp Gin Trp Pro Glu Ala Asn Gin Val Gly Ala Gly Gly Arg Ala 
50 55 60 

Gin His Asp Glu Ala val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn 
65 70 75 80 

Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg 
85 90 95 

Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn 
100 105 110 

Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys Val 
115 120 125 

Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu 
130 135 140 

His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin ser 
145 150 155 160 
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Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys 
165 170 175 

Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala Pro Ala Pro Asn 
180 185 190 

Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin 
195 200 205 

Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu 
210 215 " 220 

Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys 
225 230 235 240 

Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser 
245 250 255 

Cys Pro Pro lie cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe 
260 265 270 

lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val 
275 280 285 

Leu Leu Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro Gly 
290 295 300 

Thr ser Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala 
305 310 315 320 

Gin Gly Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp 
325 330 335 

Arg Asn cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg 
340 345 350 

Phe Leu Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu 
355 360 " 365 - 

Val Pro Phe Val Gin Trp Phe Val Gly Leu Ser Pro Thr Val Trp Leu 
370 375 J 380 

Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie 
385 390 395 400 

Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val 
405 410 415 
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Tyr lie 

<210> 157 

<211> 1254 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 61-164 (Q129R/G145R) + zz (serotype d) sequence 



<400> 157 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


acttcaaccc 


caacaaggat 


caatggccag 


aggcaaatca 


ggtaggagcg 


180 


ggcggccgcg 


cgcaacacga 


tgaagccgta 


gacaacaaat 


tcaacaaaga 


acaacaaaac 


240 


gcgttctatg 


agatcttaca 


tttacctaac 


ttaaacgaag 


aacaacgaaa 


cgccttcatc 


300 


caaagtttaa 


aagatgaccc 


aagccaaagc 


gctaaccttt 


tagcagaagc 


taaaaagcta 


360 


aatgatgctc 


aggcgccgaa 


agtagacaac 


aaattcaaca 


aagaacaaca 


aaacgcgttc 


420 


tatgagatct 


tacatttacc 


taacttaaac 


gaagaacaac 


gaaacgcctt 


catccaaagt 


480 


ttaaaagatg 


acccaagcca 


aagcgctaac 


cttttagcag 


aagctaaaaa 


gctaaatgat 


540 


gctcaggcgc 


cgaaagcggc 


cgcccctgca 


ccgaacatgg 


agaacacaac 


atcaggattc 


600 


ctaggacccc 


tgctcgtgtt 


acaggcgggg 


tttttcttgt 


tgacaagaat 


cctcacaata 


660 


ccacagagtc 


tagactcgtg 


gtggacttct 


ctcaattttc 


tagggggagc 


acccacgtgt 


720 


cctggccaaa 


attcgcagtc 


cccaacctcc 


aatcactcac 


caacctcttg 


tcctccaatt 


780 


tgtcctggct 


atcgctggat 


gtgtctgcgg 


cgttttatca 


tattcctctt 


catcctgctg 


840 


ctatgcctca 


tcttcttgtt 


ggttcttctg 


gactaccaag 


gtatgttgcc 


cgtttgtcct 


900 


ctacttccag 


gaacatcaac 


caccagcacg 


gggccatgca 


agacctgcac 


gattcctgct 


960 


cgaggaacct 


ctatgtttcc 


ctcttgttgc 


tgtacaaaac 


cttcggacag 


aaactgcact 


1020 


tgtattccca 


tcccatcatc 


ctgggctttc 


gcaagattcc 


tatgggagtg 


ggcctcagtc 


1080 


cgtttctcct 


ggctcagttt 


actagtgcca 


tttgttcagt 


ggttcgtagg 


gctttccccc 


1140 


actgtttggc 


tttcagttat 


atggatgatg 


tggtattggg 


ggccaagtct 


gtacaacatc 


1200 


ttgagtccct 


ttttacctct 


attaccaatt 


ttcttttgtc 


tttgggtata 


catt 


1254 



<210> 158 
<211> 418 
<212> PRT 

<213> artificial sequence 
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<220> 

<223> Proteins corresponding to 61-164 (Q129R/G145R) + ZZ (serotype d) 
sequence 

<400> 158 

Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Asp Phe Asn Pro Asn 
35 40 45 

Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly Ala Gly Gly Arg Ala 
50 55 60 

Gin His Asp Glu Ala Val Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn 
65 70 75 80 

Ala Phe Tyr Glu lie Leu His Leu Pro Asn Leu Asn Glu Glu Gin Arg 
85 90 95 

Asn Ala Phe lie Gin Ser Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn 
100 105 110 

Leu Leu Ala Glu Ala Lys Lys Leu Asn Asp Ala Gin Ala Pro Lys val 
115 120 125 

Asp Asn Lys Phe Asn Lys Glu Gin Gin Asn Ala Phe Tyr Glu lie Leu 
130 135 140 

His Leu Pro Asn Leu Asn Glu Glu Gin Arg Asn Ala Phe lie Gin Ser 
145 150 155 160 

Leu Lys Asp Asp Pro Ser Gin Ser Ala Asn Leu Leu Ala Glu Ala Lys 
165 170 175 

Lys Leu Asn Asp Ala Gin Ala Pro Lys Ala Ala Ala Pro Ala Pro Asn 
180 185 190 

Met Glu Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin 
195 200 205 

Ala Gly Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu 
210 215 220 

Asp Ser Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys 
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225 230 235 240 

Pro Gly Gin Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser 
245 250 255 

Cys Pro Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe 
260 265 270 

lie lie Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val 
275 280 285 

Leu Leu Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly 
290 295 300 

Thr Ser Thr Thr ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala 
305 310 315 320 

Arg Gly Thr ser Met Phe Pro ser Cys Cys Cys Thr Lys Pro Ser Asp 
325 330 335 

Arg Asn cys Thr cys lie Pro lie Pro ser ser Trp Ala Phe Ala Arg 
340 345 350 

Phe Leu Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu 
355 360 " 365 

Val Pro Phe Val Gin Trp Phe val Gly Leu Ser Pro Thr val Trp Leu 
370 375 380 

Ser val lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie 
385 390 395 400 

Leu Ser Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val 
405 410 * 415 

Tyr lie 



<210> 159 

<211> 1053 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 61-164 + EGF (serotype d) sequence 

<400> 159 

atgggaggtt ggtcttccaa acctcggaaa ggcatgggga cgaatctttc tgttcccaat 

cctctgggat tctttcccga tcaccagttg gaccctgcgt tcggagccaa ctcaaacaat 

Page 151 



sequence listing_ST25.txt 

ccagattggg acttcaaccc caacaaggat caatggccag aggcaaatca ggtaggagcg 180 

ggcggccgca tgaactctga ttccgaatgc ccgctgtctc atgacggtta ctgcctgcat 240 

gatggcgtat gcatgtacat cgaagctctg gacaaatacg catgcaactg tgttgtaggt 300 

tacatcggcg aacgttgcca gtatcgcgac ctgaaatggt gggaactgcg taaggcggcc 360 

gcccctgcac cgaacatgga gaacacaaca tcaggattcc taggacccct gctcgtgtta 420 

caggcggggt ttttcttgtt gacaagaatc ctcacaatac cacagagtct agactcgtgg 480 

tggacttctc tcaattttct agggggagca cccacgtgtc ctggccaaaa ttcgcagtcc 540 

ccaacctcca atcactcacc aacctcttgt cctccaattt gtcctggcta tcgctggatg 600 

tgtctgcggc gttttatcat attcctcttc atcctgctgc tatgcctcat cttcttgttg 660 

gttcttctgg actaccaagg tatgttgccc gtttgtcctc tacttccagg aacatcaacc 720 

accagcacgg ggccatgcaa gacctgcacg attcctgctc aaggaacctc tatgtttccc 780 

tcttgttgct gtacaaaacc ttcggacgga aactgcactt gtattcccat cccatcatcc 840 

tgggctttcg caagattcct atgggagtgg gcctcagtcc gtttctcctg gctcagttta 900 

ctagtgccat ttgttcagtg gttcgtaggg ctttccccca ctgtttggct ttcagttata 960 

tggatgatgt ggtattgggg gccaagtctg tacaacatct tgagtccctt tttacctcta 1020 

ttaccaattt tcttttgtct ttgggtatac att 1053 

<210> 160 

<211> 351 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 61-164 + EGF (serotype d) sequence 

<400> 160 

Met Gly Gly Trp Ser ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 



Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Asp Phe Asn Pro Asn 
35 40 45 

Lys Asp Gin Trp Pro Glu Ala Asn Gin Val Gly Ala Gly Gly Arg Met 
50 55 60 

Asn ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His 
65 70 75 80 
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Asp Gly Val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala Cys Asn 
85 90 95 

Cys val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr Arg Asp Leu Lys 
100 105 110 

Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn 
115 120 125 

Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin Ala Gly Phe 
130 135 140 

Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp 
145 150 155 160 

Trp Thr ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin 
165 170 175 

Asn Ser Gin ser Pro Thr Ser Asn His Ser Pro Thr ser Cys Pro Pro 
180 185 190 

He Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe 
195 200 205 

Leu Phe lie Leu Leu Leu Cys Leu He Phe Leu Leu val Leu Leu Asp 
210 215 220 

Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly Thr Ser Thr 
225 230 235 240 

Thr Ser Thr Gly Pro Cys Lys Thr cys Thr lie Pro Ala Gin Gly Thr 
245 250 255 

Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn cys 
260 265 270 

Thr Cys lie Pro lie Pro Ser ser Trp Ala Phe Ala Arg Phe Leu Trp 
275 280 285 

Glu Trp Ala ser Val Arg Phe Ser Trp Leu Ser Leu Leu val Pro Phe 
290 295 300 

Val Gin Trp Phe val Gly Leu Ser Pro Thr val Trp Leu Ser Val lie 
305 310 315 320 

Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro 
325 330 335 

Page 153 



sequence "listing_ST25.txt 



Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr lie 
340 345 350 



<210> 161 

<211> 1053 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 61-164 (Q129R) + EGF (serotype d) sequence 



<400> 161 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


acttcaaccc 


caacaaggat 


caatggccag 


aggcaaatca 


ggtaggagcg 


180 


ggcggccgca 


tgaactctga 


ttccgaatgc 


ccgctgtctc 


atgacggtta 


ctgcctgcat 


240 


gatggcgtat 


gcatgtacat 


cgaagctctg 


gacaaatacg 


catgcaactg 


tgttgtaggt 


300 


tacatcggcg 


aacgttgcca 


gtatcgcgac 


ctgaaatggt 


gggaactgcg 


taaggcggcc 


360 


gcccctgcac 


cgaacatgga 


gaacacaaca 


tcaggattcc 


taggacccct 


gctcgtgtta 


420 


caggcggggt 


ttttcttgtt 


gacaagaatc 


ctcacaatac 


cacagagtct 


agactcgtgg 


480 


tggacttctc 


tcaattttct 


agggggagca 


cccacgtgtc 


ctggccaaaa 


ttcgcagtcc 


540 


ccaacctcca 


atcactcacc 


aacctcttgt 


cctccaattt 


gtcctggcta 


tcgctggatg 


600 


tgtctgcggc 


gttttatcat 


attcctcttc 


atcctgctgc 


tatgcctcat 


cttcttgttg 


660 


gttcttctgg 


actaccaagg 


tatgttgccc 


gtttgtcctc 


tacttccagg 


aacatcaacc 


720 


accagcacgg 


ggccatgcaa 


gacctgcacg 


attcctgctc 


gaggaacctc 


tatgtttccc 


780 


tcttgttgct 


gtacaaaacc 


ttcggacgga 


aactgcactt 


gtattcccat 


cccatcatcc 


840 


tgggctttcg 


caagattcct 


atgggagtgg 


gcctcagtcc 


gtttctcctg 


gctcagttta 


900 


ctagtgccat 


ttgttcagtg 


gttcgtaggg 


ctttccccca 


ctgtttggct 


ttcagttata 


960 


tggatgatgt 


ggtattgggg 


gccaagtctg 


tacaacatct 


tgagtccctt 


tttacctcta 


1020 


ttaccaattt 


tcttttgtct 


ttgggtatac 


att 






1053 



<210> 162 

<211> 351 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 61-164 (Q129R) + EGF (serotype d) 
sequence 

<400> 162 
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Met Gly Gly Trp Ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 ' 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Asp Phe Asn Pro Asn 
35 40 45 

Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly Ala Gly Gly Arg Met 
50 55 60 

Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His 
65 70 75 80 

Asp Gly Val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala Cys Asn 
85 90 95 

Cys val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr Arg Asp Leu Lys 
100 105 110 

Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn 
115 ~ 120 125 

Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin Ala Gly Phe 
130 135 140 

Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp 
145 150 155 160 

Trp Thr ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin 
165 170 175 

Asn Ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr ser cys Pro Pro 
180 185 190 

lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe 
195 " 200 " 205 

Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val Leu Leu Asp 
210 215 220 

Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly Thr Ser Thr 
225 230 235 240 

Thr Ser Thr Gly Pro Cys Lys Thr cys Thr lie Pro Ala Arg Gly Thr 
245 250 255 
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Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys 

260 265 270 

Thr cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp 
275 280 285 

Glu Trp Ala Ser val Arg Phe ser Trp Leu Ser Leu Leu Val Pro Phe 
290 295 300 

Val Gin Trp Phe Val Gly Leu Ser Pro Thr val Trp Leu Ser val lie 
305 310 315 320 

Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro 



Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr lie 
340 345 350 

<210> 163 
<211> 1053 
<212> DNA 

<213> artificial sequence 
<220> 

<223> 61-164 (G145R) + EGF (serotype d) sequence 
<400> 163 

atgggaggtt ggtcttccaa acctcggaaa ggcatgggga cgaatctttc tgttcccaat 60 

cctctgggat tctttcccga tcaccagttg gaccctgcgt tcggagccaa ctcaaacaat 120 

ccagattggg acttcaaccc caacaaggat caatggccag aggcaaatca ggtaggagcg 180 

ggcggccgca tgaactctga ttccgaatgc ccgctgtctc atgacggtta ctgcctgcat 240 

gatggcgtat gcatgtacat cgaagctctg gacaaatacg catgcaactg tgttgtaggt 300 

tacatcggcg aacgttgcca gtatcgcgac ctgaaatggt gggaactgcg taaggcggcc 360 

gcccctgcac cgaacatgga gaacacaaca tcaggattcc taggacccct gctcgtgtta 420 

caggcggggt ttttcttgtt gacaagaatc ctcacaatac cacagagtct agactcgtgg 480 

tggacttctc tcaattttct agggggagca cccacgtgtc ctggccaaaa ttcgcagtcc 540 

ccaacctcca atcactcacc aacctcttgt cctccaattt gtcctggcta tcgctggatg 600 

tgtctgcggc gttttatcat attcctcttc atcctgctgc tatgcctcat cttcttgttg 660 

gttcttctgg actaccaagg tatgttgccc gtttgtcctc tacttccagg aacatcaacc 720 

accagcacgg ggccatgcaa gacctgcacg attcctgctc aaggaacctc tatgtttccc 780 

tcttgttgct gtacaaaacc ttcggacaga aactgcactt gtattcccat cccatcatcc 840 

tgggctttcg caagattcct atgggagtgg gcctcagtcc gtttctcctg gctcagttta 900 
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ctagtgccat ttgttcagtg gttcgtaggg ctttccccca ctgtttggct ttcagttata 960 

tggatgatgt ggtattgggg gccaagtctg tacaacatct tgagtccctt tttacctcta 1020 

ttaccaattt tcttttgtct ttgggtatac att 1053 

<210> 164 

<211> 351 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to61-164 (G145R) + EGF (serotype d) 
sequence 

<400> 164 

Met Gly Gly Trp ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Asp Phe Asn Pro Asn 
35 40 45 

Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly Ala Gly Gly Arg Met 
50 55 60 

Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr cys Leu His 
65 70 75 80 

Asp Gly val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala Cys Asn 
85 90 95 

Cys val Val Gly Tyr lie Gly Glu Arg Cys Gin Tyr Arg Asp Leu Lys 
100 105 110 

Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn 
115 120 125 

Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin Ala Gly Phe 
130 135 140 

iPhe Leu Leu Thr Arg lie Leu Thr lie Pro Gin ser Leu Asp Ser Trp 
145 150 155 160 

Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin 
165 170 175 
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Asn ser Gin Ser Pro Thr ser Asn His Ser Pro fnr ser Cys Pro Pro 
180 185 190 



lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe 
195 200 205 

Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu Asp 
210 215 220 

Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro Gly Thr Ser Thr 
225 230 235 240 

Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala Gin Gly Thr 
245 250 255 

Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Arg Asn Cys 
260 265 270 

Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp 
275 280 285 

Glu Trp Ala Ser Val Arg Phe ser Trp Leu Ser Leu Leu val Pro Phe 
290 295 300 

val Gin Trp Phe val Gly Leu Ser Pro Thr val Trp Leu Ser val lie 
305 310 315 320 

Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro 
325 330 335 

Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr lie 

345 350 





340 


<210> 


165 


<:211> 


1053 


<:212> 


DNA 


<213> 


artificial sequence 


<220> 




<223> 


61-164 (Q129R/G145R) 


<400> 


165 



atgggaggtt ggtcttccaa acctcggaaa ggcatgggga cgaatctttc tgttcccaat 60 

cctctgggat tctttcccga tcaccagttg gaccctgcgt tcggagccaa ctcaaacaat 120 

ccagattggg acttcaaccc caacaaggat caatggccag aggcaaatca ggtaggagcg 180 

ggcggccgca tgaactctga ttccgaatgc ccgctgtctc atgacggtta ctgcctgcat 240 

gatggcgtat gcatgtacat cgaagctctg gacaaatacg catgcaactg tgttgtaggt 300 
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tacatcggcg 


aacgttgcca 


gtatcgcgac 


ctgaaatggt gggaactgcg taaggcggcc 


360 


gcccctgcac 


cgaacatgga 


gaacacaaca 


tcaggattcc taggacccct gctcgtgtta 


420 


caggcggggt 


ttttcttgtt 


gacaagaatc 


ctcacaatac 


cacagagtct 


agactcgtgg 


480 


tggacttctc 


tcaattttct 


agggggagca 


cccacgtgtc 


ctggccaaaa 


ttcgcagtcc 


540 


ccaacctcca 


atcactcacc 


aacctcttgt 


cctccaattt 


gtcctggcta 


tcgctggatg 


600 


tgtctgcggc 


gttttatcat 


attcctcttc 


atcctgctgc 


tatgcctcat 


cttcttgttg 


660 


gttcttctgg 


actaccaagg 


tatgttgccc 


gtttgtcctc tacttccagg aacatcaacc 


720 


accagcacgg 


ggccatgcaa 


gacctgcacg 


attcctgctc 


gaggaacctc 


tatgtttccc 


780 


tcttgttgct 


gtacaaaacc 


ttcggacaga 


aactgcactt 


gtattcccat 


cccatcatcc 


840 


tgggctttcg 


caagattcct 


atgggagtgg 


gcctcagtcc 


gtttctcctg 


gctcagttta 


900 


ctagtgccat 


ttgttcagtg 


gttcgtaggg 


ctttccccca 


ctgtttggct 


ttcagttata 


960 


tggatgatgt 


ggtattgggg 


gccaagtctg 


tacaacatct 


tgagtccctt 


tttacctcta 


1020 


ttaccaattt 


tcttttgtct 


ttgggtatac 


att 






1053 



<210> 166 

<211> 351 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 61-164 (Q129R/G145R) + EGF (serotype d) 
sequence 

<400> 166 

Met Gly Gly Trp ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 

Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Asp Phe Asn Pro Asn 
35 40 45 

Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly Ala Gly Gly Arg Met 
50 55 60 

Asn Ser Asp Ser Glu Cys Pro Leu Ser His Asp Gly Tyr Cys Leu His 
65 70 75 80 

Asp Gly Val Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala Cys Asn 
85 90 95 



Cys val val Gly Tyr lie Gly Glu Arg Cys Gin Tyr Arg Asp Leu Lys 
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100 105 110 

Trp Trp Glu Leu Arg Lys Ala Ala Ala Pro Ala Pro Asn Met Glu Asn 
115 120 125 

Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin Ala Gly Phe 
130 " 135 140 

Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser Trp 
145 150 155 160 

Trp Thr ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly Gin 
165 ' 170 175 

Asn ser Gin ser Pro Thr Ser Asn His Ser Pro Thr ser cys Pro Pro 
180 185 190 

lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie Phe 
195 ~ 200 205 

Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu Asp 
210 215 220 

Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly Thr Ser Thr 
225 230 235 240 

Thr Ser Thr Gly Pro Cys Lys Thr cys Thr lie Pro Ala Arg Gly Thr 
245 250 255 

Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Arg Asn Cys 
260 265 270 

Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu Trp 
275 280 285 

Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu Val Pro Phe 
290 295 300 

val Gin Trp Phe val Gly Leu Ser Pro Thr val Trp Leu ser val lie 
305 310 315 320 

Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro 
325 330 335 

Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr lie 
340 345 350 
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<210> 167 

<211> 912 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 61-164 + STR-tag (serotype d) sequence 



<400> 167 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


acttcaaccc 


caacaaggat 


caatggccag 


aggcaaatca 


ggtaggagcg 


180 


ggcggccgct 


ggagccaccc 


gcagttcgaa 


aaagcggccg 


cccctgcacc 


gaacatggag 


240 


aacacaacat 


caggattcct 


aggacccctg 


ctcgtgttac 


aggcggggtt 


tttcttgttg 


300 


acaagaatcc 


tcacaatacc 


acagagtcta 


gactcgtggt 


ggacttctct 


caattttcta 


360 


gggggagcac 


ccacgtgtcc 


tggccaaaat 


tcgcagtccc 


caacctccaa 


tcactcacca 


420 


acctcttgtc 


ctccaatttg 


tcctggctat 


cgctggatgt 


gtctgcggcg 


ttttatcata 


480 


ttcctcttca 


tcctgctgct 


atgcctcatc 


ttcttgttgg 


ttcttctgga 


ctaccaaggt 


540 


atgttgcccg 


tttgtcctct 


acttccagga 


acatcaacca 


ccagcacggg 


gccatgcaag 


600 


acctgcacga 


ttcctgctca 


aggaacctct 


atgtttccct 


cttgttgctg 


tacaaaacct 


660 


tcggacggaa 


actgcacttg 


tattcccatc 


ccatcatcct 


gggctttcgc 


aagattccta 


720 


tgggagtggg 


cctcagtccg 


tttctcctgg 


ctcagtttac 


tagtgccatt 


tgttcagtgg 


780 


ttcgtagggc 


tttcccccac 


tgtttggctt 


tcagttatat 


ggatgatgtg 


gtattggggg 


840 


ccaagtctgt 


acaacatctt 


gagtcccttt 


ttacctctat 


taccaatttt 


cttttgtctt 


900 


tgggtataca 


tt 










912 



<210> 168 

<211> 304 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 61-164 + STR-tag (serotype d) sequence 

<400> 168 

Met Gly Gly Trp Ser ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 



Ser val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 



Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Asp Phe Asn Pro Asn 
35 40 45 
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Lys Asp Gin Trp Pro Glu Ala Asn Gin Val Gly Ala Gly Gly Arg Trp 
50 55 60 

Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala Pro Asn Met Glu 
65 70 75 80 

Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin Ala Gly 
85 90 95 

Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser 
100 105 110 

Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly 
115 120 125 

Gin Asn ser Gin Ser Pro Thr ser Asn His Ser Pro Thr Ser Cys Pro 
130 135 140 

Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie 
145 150 155 ^ 160 

Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu 
165 170 175 

Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly Thr Ser 
180 185 190 

Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala Gin Gly 
195 200 205 

Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn 
210 215 220 

Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu 
225 230 235 240 

Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu val Pro 
245 ~ 250 255 

Phe val Gin Trp Phe val Gly Leu ser Pro Thr val Trp Leu Ser val 
260 265 270 

lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn He Leu Ser 
275 280 285 

Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val Tyr lie 
290 295 300 
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<210> 169 
<211> 912 
<212> DNA 

<213> artificial sequence 



<220> 

<223> 61-164 (Q129R) 


+ STR-tag < 


[serotype d) 


> sequence 






<400> 169 
axgggaggtt 


ggtc l uccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cc lc tggga l 


re l t rcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccaga l tgyg 


ac u rcaaccc 


caacaaggat 


caatggccag 


aggcaaatca 


ggtaggagcg 


180 


ggcggccgci 


ggagccaccc 


gcagttcgaa 


aaagcggccg 


cccctgcacc 


gaacatggag 


240 


ddCdCddCdL 


cagga l lcc l 


aggacccctg 


ctcgtgttac 


aggcggggtt 


tttcttgttg 


300 


acaagaatcc 


tcacaatacc 


acagagtcta 


gactcgtggt 


ggacttctct 


caattttcta 


360 


gggggagcac 


ccacgtgtcc 


tggccaaaat 


tcgcagtccc 


caacctccaa 


tcactcacca 


420 


acctcttgtc 


ctccaatttg 


tcctggctat 


cgctggatgt 


gtctgcggcg 


ttttatcata 


480 


ttcctcttca 


tcctgctgct 


atnrrtratr 


l-1"rl"l"nl"l-nn 


l iLt l y y d 






atgttgcccg 


tttgtcctct 


acttccagga 


acatcaacca 


ccagcacggg 


gccatgcaag 


600 


acctgcacga 


ttcctgctcg 


aggaacctct 


atgtttccct 


cttgttgctg 


tacaaaacct 


660 


tcggacggaa 


actgcacttg 


tattcccatc 


ccatcatcct 


gggctttcgc 


aagattccta 


720 


tgggagtggg 


cctcagtccg 


tttctcctgg 


ctcagtttac 


tagtgccatt 


tgttcagtgg 


780 


ttcgtagggc 


tttcccccac 


tgtttggctt 


tcagttatat 


ggatgatgtg 


gtattggggg 


840 


ccaagtctgt 


acaacatctt 


gagtcccttt 


ttacctctat 


taccaatttt 


cttttgtctt 


900 


tgggtataca 


tt 










912 



<210> 170 

<211> 304 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 61-164 (Q129R) + STR-tag (serotype d) 
sequence 

<400> 170 

Met Gly Gly Trp Ser ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 

Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 
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Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Asp Phe Asn Pro Asn 
35 40 45 

Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly Ala Gly Gly Arg Trp 
50 55 60 

Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala Pro Asn Met Glu 
65 70 75 80 

Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin Ala Gly 
85 90 95 

Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser 
100 105 110 

Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly 
115 120 125 

Gin Asn ser Gin ser Pro Thr Ser Asn His Ser Pro Thr Ser Cys Pro 
130 135 140 

Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie 
145 150 155 160 

Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu 
165 170 175 

Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro Gly Thr Ser 
180 185 190 

Thr Thr Ser Thr Gly Pro Cys Lys Thr Cys Thr lie Pro Ala Arg Gly 
195 200 205 

Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Gly Asn 
210 215 220 

Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu 
225 230 235 240 

Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu val Pro 
245 250 255 

Phe val Gin Trp Phe val Gly Leu ser Pro Thr val Trp Leu Ser val 
260 265 270 

lie Trp Met Met Trp Tyr Trp Gly Pro ser Leu Tyr Asn lie Leu Ser 
275 280 285 
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Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val Tyr lie 
290 295 300 

<210> 171 

<211> 912 

<212> DNA 

<213> artificial sequence 



<220> 

<223> 61-164 (G145R) 


+ STR-tag i 


(serotype d} 


) sequence 






<fdc\n^ i7i 
atgggaggtt 


ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat 


tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg 


acttcaaccc 


caacaaggat 


caatggccag 


aggcaaatca 


ggtaggagcg 


180 


ggcggccgct 


ggagccaccc 


gcagttcgaa 


aaagcggccg 


cccctgcacc 


gaacatggag 


240 


aacacaacat 


caggattcct 


aggacccctg 


ctcgtgttac 


aggcggggtt 


tttcttgttg 


300 


acaagaatcc 


tcacaatacc 


acagagtcta 


gactcgtggt 


ggacttctct 


caattttcta 


360 


gggggagcac 


ccacgtgtcc 


tggccaaaat 


tcgcagtccc 


caacctccaa 


tcactcacca 


420 


acctcttgtc 


ctccaatttg 


tcctggctat 


cgctggatgt 


gtctgcggcg 


ttttatcata 


480 


ttcctcttca 


tcctgctgct 


atgcctcatc 


ttcttgttgg 


ttcttctgga 


ctaccaaggt 


540 


atgttgcccg 


tttgtcctct 


acttccagga 


acatcaacca 


ccagcacggg 


gccatgcaag 


600 


acctgcacga 


ttcctgctca 


aggaacctct 


atgtttccct 


cttgttgctg 


tacaaaacct 


660 


tcggacagaa 


actgcacttg 


tattcccatc 


ccatcatcct 


gggctttcgc 


aagattccta 


720 


tgggagtggg 


cctcagtccg 


tttctcctgg 


ctcagtttac 


tagtgccatt 


tgttcagtgg 


780 


ttcgtagggc 


tttcccccac 


tgtttggctt 


tcagttatat 


ggatgatgtg 


gtattggggg 


840 


ccaagtctgt 


acaacatctt 


gagtcccttt 


ttacctctat 


taccaatttt 


cttttgtctt 


900 


tgggtataca 


tt 










912 



<210> 172 

<211> 304 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 61-164 (G145R) + STR-tag (serotype d) 
sequence 

<400> 172 

Met Gly Gly Trp Ser ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
1 5 10 15 



Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
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20 25 ~ 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Asp Phe Asn Pro Asn 
35 40 45 

Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly Ala Gly Gly Arg Trp 
50 55 60 

Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala Pro Asn Met Glu 

65 70 75 80 

Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu val Leu Gin Ala Gly 
85 90 95 

Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser 
100 105 110 

Trp Trp Thr ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly 
115 120 ' 125 

Gin Asn ser Gin Ser Pro Thr Ser Asn His Ser Pro Thr Ser Cys Pro 
130 135 140 



Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie 
145 150 " 155 160 

phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val Leu Leu 
165 170 175 

Asp Tyr Gin Gly Met Leu Pro val Cys Pro Leu Leu Pro Gly Thr ser 
180 185 190 

Thr Thr Ser Thr Gly Pro Cys Lys Thr cys Thr lie Pro Ala Gin Gly 
195 200 205 

Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Arg Asn 
210 215 220 

Cys Thr Cys lie Pro lie Pro Ser ser Trp Ala Phe Ala Arg Phe Leu 
225 ' 230 235 ~ 240 

Trp Glu Trp Ala Ser Val Arg Phe Ser Trp Leu Ser Leu Leu val Pro 
245 " 250 255 

Phe val Gin Trp Phe val Gly Leu Ser Pro Thr val Trp Leu Ser val 
260 265 270 
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lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser 
275 280 285 



Pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp val Tyr lie 
290 295 300 



<210> 173 

<211> 912 

<212> DNA 

<213> artificial sequence 
<220> 

<223> 61-164 (Q129R/G145R) + STR-tag (serotype d) sequence 



<400> 173 

atgggaggtt ggtcttccaa 


acctcggaaa 


ggcatgggga 


cgaatctttc 


tgttcccaat 


60 


cctctgggat tctttcccga 


tcaccagttg 


gaccctgcgt 


tcggagccaa 


ctcaaacaat 


120 


ccagattggg acttcaaccc 


caacaaggat 


caatggccag 


aggcaaatca 


ggtaggagcg 


180 


ggcggccgct ggagccaccc 


gcagttcgaa 


aaagcggccg 


cccctgcacc 


gaacatggag 


240 


aacacaacat caggattcct 


aggacccctg 


ctcgtgttac 


aggcggggtt 


tttcttgttg 


300 


acaagaatcc tcacaatacc 


acagagtcta 


gactcgtggt 


ggacttctct 


caattttcta 


360 


gggggagcac ccacgtgtcc 


tggccaaaat 


tcgcagtccc 


caacctccaa 


tcactcacca 


420 


acctcttgtc ctccaatttg 


tcctggctat 


cgctggatgt 


gtctgcggcg 


ttttatcata 


480 


ttcctcttca tcctgctgct 


atgcctcatc 


ttcttgttgg 


ttcttctgga 


ctaccaaggt 


540 


atgttgcccg tttgtcctct 


acttccagga 


acatcaacca 


ccagcacggg 


gccatgcaag 


600 


acctgcacga ttcctgctcg 


aggaacctct 


atgtttccct 


cttgttgctg 


tacaaaacct 


660 


tcggacagaa actgcacttg 


tattcccatc 


ccatcatcct 


gggctttcgc 


aagattccta 


720 


tgggagtggg cctcagtccg 


tttctcctgg 


ctcagtttac 


tagtgccatt 


tgttcagtgg 


780 


ttcgtagggc tttcccccac 


tgtttggctt 


tcagttatat 


ggatgatgtg 


gtattggggg 


840 


ccaagtctgt acaacatctt 


gagtcccttt 


ttacctctat 


taccaatttt 


cttttgtctt 


900 


tgggtataca tt 










912 



<:210> 174 

<:211> 304 

<212> PRT 

<213> artificial sequence 
<220> 

<223> Proteins corresponding to 61-164 (Q129R/G145R) + STR-tag 
(serotype d) sequence 

<400> 174 

Met Gly Gly Trp ser Ser Lys Pro Arg Lys Gly Met Gly Thr Asn Leu 
15 10 15 
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Ser Val Pro Asn Pro Leu Gly Phe Phe Pro Asp His Gin Leu Asp Pro 
20 25 30 

Ala Phe Gly Ala Asn Ser Asn Asn Pro Asp Trp Asp Phe Asn Pro Asn 
35 40 45 

Lys Asp Gin Trp Pro Glu Ala Asn Gin val Gly Ala Gly Gly Arg Trp 
50 55 60 

Ser His Pro Gin Phe Glu Lys Ala Ala Ala Pro Ala Pro Asn Met Glu 
65 70 75 80 

Asn Thr Thr Ser Gly Phe Leu Gly Pro Leu Leu Val Leu Gin Ala Gly 
85 90 95 

Phe Phe Leu Leu Thr Arg lie Leu Thr lie Pro Gin Ser Leu Asp Ser 
100 105 110 

Trp Trp Thr Ser Leu Asn Phe Leu Gly Gly Ala Pro Thr Cys Pro Gly 
115 120 125 

Gin Asn Ser Gin Ser Pro Thr ser Asn His Ser Pro Thr ser Cys Pro 
130 135 140 

Pro lie Cys Pro Gly Tyr Arg Trp Met Cys Leu Arg Arg Phe lie lie 
145 150 155 " 160 

Phe Leu Phe lie Leu Leu Leu Cys Leu lie Phe Leu Leu val Leu Leu 
165 ~ 170 175 

Asp Tyr Gin Gly Met Leu Pro Val Cys Pro Leu Leu Pro Gly Thr ser 
180 185 190 

Thr Thr ser Thr Gly Pro Cys Lys Thr cys Thr lie Pro Ala Arg Gly 
195 200 205 

Thr Ser Met Phe Pro Ser Cys Cys Cys Thr Lys Pro Ser Asp Arg Asn 
210 215 220 

Cys Thr Cys lie Pro lie Pro Ser Ser Trp Ala Phe Ala Arg Phe Leu 
225 230 235 240 

Trp Glu Trp Ala Ser val Arg Phe Ser Trp Leu Ser Leu Leu Val Pro 
245 250 255 

Phe val Gin Trp Phe val Gly Leu Ser Pro Thr val Trp Leu Ser val 
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265 270 



lie Trp Met Met Trp Tyr Trp Gly Pro Ser Leu Tyr Asn lie Leu Ser 
275 280 285 

pro Phe Leu Pro Leu Leu Pro lie Phe Phe Cys Leu Trp Val Tyr lie 
290 295 300 

<210> 175 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) sequence-1 

<400> 175 



atggcctcgt 


accccggcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcccggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


ccacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcggta 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


accaacatcg 


tgcttggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


ggacctggct 


atgctggctg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga 


ggactgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


tccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


cccccggctc 


cataccgacg 


1080 


atatgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 176 
<211> 1128 
<212> PRT 

<213> artificial sequence 
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<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) sequence-2 
<400> 176 

Ala Thr Gly Gly Cys Thr Thr Cys Gly Thr Ala Cys Cys Cys Cys Thr 
1 5 10 15 

Gly Cys Cys Ala Thr cys Ala Ala cys Ala cys Gly cys Gly Thr cys 
20 25 30 

Thr Gly Cys Gly Thr Thr Cys Gly Ala Cys Cys Ala Gly Gly Cys Thr 
35 40 45 

Gly Cys Gly Cys Gly Thr Thr Cys Thr Cys Gly Cys Gly Gly Cys Cys 
50 ' 55 60 

Ala Thr Ala Gly Cys Ala Ala Cys Cys Gly Ala Cys Gly Thr Ala Cys 
65 70 75 80 

Gly Gly Cys Gly Thr Thr Gly Cys Gly Cys Cys Cys Thr Cys Gly Cys 
85 90 95 

Cys Gly Gly Cys Ala Gly Cys Ala Ala Gly Ala Ala Gly Cys Cys Ala 
100 105 110 

Cys Gly Gly Ala Ala Gly Thr cys Cys Gly Cys Cys Thr Gly Gly Ala 
115 120 125 

Gly Cys Ala Gly Ala Ala Ala Ala Thr Gly Cys Cys Cys Ala Cys Gly 
130 * 135 140 

Cys Thr Ala Cys Thr Gly Cys Gly Gly Gly Thr Thr Thr Ala Thr Ala 
145 150 155 160 

Thr Ala Gly Ala cys Gly Gly Thr cys Cys Thr Cys Ala Cys Gly Gly 
165 170 175 

Gly Ala Thr Gly Gly Gly Gly Ala Ala Ala Ala Cys Cys Ala Cys Cys 
180 185 190 

Ala Cys Cys Ala Cys Gly Cys Ala Ala Cys Thr Gly Cys Thr Gly Gly 
195 200 205 

Thr Gly Gly Cys Cys Cys Thr Gly Gly Gly Thr Thr Cys Gly Cys Gly 
210 215 220 

Cys Gly Ala Cys Gly Ala Thr Ala Thr Cys Gly Thr Cys Thr Ala Cys 
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225 230 235 240 

Gly Thr Ala Cys Cys Cys Gly Ala Gly Cys Cys Gly Ala Thr Gly Ala 
245 250 255 

Cys Thr Thr Ala Cys Thr Gly Gly cys Ala Gly Gly Thr Gly Cys Thr 
260 265 270 

Gly Gly Gly Gly Gly Cys Thr Thr Cys Cys Gly Ala Gly Ala Cys Ala 
275 280 285 

Ala Thr Cys Gly Cys Gly Ala Ala Cys Ala Thr Cys Thr Ala Cys Ala 
290 295 300 

Cys cys Ala Cys Ala Cys Ala Ala Cys Ala Cys Cys Gly Cys Cys Thr 
305 310 315 320 

cys Gly Ala Cys Cys Ala Gly Gly Gly Thr Gly Ala Gly Ala Thr Ala 
325 330 * 335 

Thr Cys Gly Gly Cys Cys Gly Gly Gly Gly Ala Cys Gly Cys Gly Gly 
340 345 350 

Cys Gly Gly Thr Gly Gly Thr Ala Ala Thr Gly Ala cys Ala Ala Gly 
355 360 365 

Cys Gly Cys Cys Cys Ala Gly Ala Thr Ala Ala Cys Ala Ala Thr Gly 
370 375 380 

Gly Gly Cys Ala Thr Gly Cys Cys Thr Thr Ala Thr Gly Cys Cys Gly 
385 390 395 400 

Thr Gly Ala Cys Cys Gly Ala Cys Gly Cys Cys Gly Thr Thr cys Thr 
405 410 415 

Gly Gly Cys Thr Cys Cys Thr Cys Ala Thr Gly Thr Cys Gly Gly Gly 
420 425 430 

Gly Gly Gly Gly Ala Gly Gly Cys Thr Gly Gly Gly Ala Gly Thr Thr 
435 " 440 445 

Cys Ala Cys Ala Thr Gly Cys Cys Cys Cys Gly Cys Cys Cys Cys Cys 
450 455 460 

Gly Gly Cys Cys Cys Thr Cys Ala Cys Cys Cys Thr Cys Ala Thr cys 
465 470 475 480 
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Thr Thr Cys Gly Ala Cys Cys Gly Cys Cys Ala fnr Cys Cys Cys Ala 
485 490 495 

Thr cys Gly Cys Cys Gly Cys Cys Cys Thr Cys Cys Thr Gly Thr Gly 
500 505 510 

Cys Thr Ala Cys Cys Cys Gly Gly Cys Cys Gly Cys Gly Cys Gly Ala 
515 520 525 

Thr Ala Cys Cys Thr Thr Ala Thr Gly Gly Gly Cys Ala Gly Cys Ala 
530 535 540 

Thr Gly Ala Cys Cys Cys Cys Cys Cys Ala Gly Gly Cys Cys Gly Thr 
545 550 555 560 

Gly Cys Thr Gly Gly Cys Gly Thr Thr Cys Gly Thr Gly Gly Cys Cys 
565 * 570 575 

Cys Thr Cys Ala Thr Cys Cys Cys Gly Cys Cys Gly Ala Cys Cys Thr 
580 585 590 

Thr Gly Cys Cys Cys Gly Gly Cys Ala Cys Ala Ala Ala Cys Ala Thr 
595 600 605 

Cys Gly Thr Gly Thr Thr Gly Gly Gly Gly Gly Cys Cys Cys Thr Thr 
610 615 620 

Cys Cys Gly Gly Ala Gly Gly Ala Cys Ala Gly Ala Cys Ala Cys Ala 
625 630 635 640 

Thr cys Gly Ala Cys Cys Gly Cys Cys Thr Gly Gly Cys Cys Ala Ala 
645 650 655 

Ala Cys Gly Cys Cys Ala Gly Cys Gly Cys Cys Cys Cys Gly Gly Cys 
660 665 * 670 

Gly Ala Gly Cys Gly Gly Cys Thr Thr Gly Ala Cys Cys Thr Gly Gly 
675 680 685 

Cys Thr Ala Thr Gly Cys Thr Gly Gly Cys Cys Gly Cys Gly Ala Thr 
690 695 700 

Thr Cys Gly Cys Cys Gly Cys Gly Thr Thr Thr Ala Cys Gly Gly Gly 
705 710 715 720 

Cys Thr Gly Cys Thr Thr Gly Cys Cys Ala Ala Thr Ala Cys Gly Gly 
725 730 735 
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Thr Gly Cys Gly Gly Thr Ala Thr Cys Thr Gly Cys Ala Gly Gly Gly 
740 745 750 



Cys Gly Gly Cys Gly Gly Gly Thr Cys Gly Thr Gly Gly Thr Gly Gly 
755 760 765 



Gly Ala Gly Gly Ala Thr Thr Gly Gly Gly Gly Ala Cys Ala Gly Cys 
770 775 780 



Thr Thr Thr Cys Gly Gly Gly Gly Ala Cys Gly Gly Cys Cys Gly Thr 
785 790 795 800 



Gly Cys Cys Gly Cys Cys Cys Cys Ala Gly Gly Gly Thr Gly Cys Cys 
805 810 815 



Gly Ala Gly Cys Cys Cys Cys Ala Gly Ala Gly Cys Ala Ala Cys Gly 
820 825 830 



Cys Gly Gly Gly Cys Cys Cys Ala Cys Gly Ala cys cys Cys Cys Ala 
835 840 845 



Thr Ala Thr Cys Gly Gly Gly Gly Ala Cys Ala Cys Gly Thr Thr Ala 
850 855 860 



Thr Thr Thr Ala Cys Cys Cys Thr Gly Thr Thr Thr Cys Gly Gly Gly 
865 870 875 880 



Cys Cys Cys Cys Cys Gly Ala Gly Thr Thr Gly Cys Thr Gly Gly Cys 
885 890 895 



Cys Cys Cys Cys Ala Ala Cys Gly Gly Cys Gly Ala Cys Cys Thr Gly 
900 905 910 



Thr Ala Thr Ala Ala Cys Gly Thr Gly Thr Thr Thr Gly Cys Cys Thr 
915 920 925 



Gly Gly Gly Cys Cys Thr Thr Gly Gly Ala Cys Gly Thr Cys Thr Thr 



Gly Gly Cys Cys Ala Ala Ala Cys Gly Cys Cys Thr cys Cys Gly Thr 
945 950 955 960 



Cys Cys Cys Ala Thr Gly Cys Ala Cys Gly Thr cys Thr Thr Thr Ala 
965 970 975 



Thr cys Cys Thr Gly Gly Ala Thr Thr Ala Cys Gly Ala Cys Cys Ala 



930 



935 



940 



980 



985 



990 
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Ala Thr Cys Gly Cys Cys Cys Gly Cys Cys Gly Gly Cys Thr Gly Cys 
995 1000 1005 

Cys Gly Gly Gly Ala Cys Gly Cys Cys Cys Thr Gly Cys Thr Gly 
1010 1015 1020 

Cys Ala Ala Cys Thr Thr Ala Cys Cys Thr Cys Cys Gly Gly Gly 
1025 1030 1035 

Ala Thr Gly Gly Thr Cys Cys Ala Gly Ala Cys Cys Cys Ala Cys 
1040 1045 1050 

Gly Thr Cys Ala Cys Cys Ala Cys Cys Cys Cys Ala Gly Gly Cys 
1055 1060 1065 

Thr Cys Cys Ala Thr Ala Cys Cys Gly Ala Cys Gly Ala Thr Cys 
1070 1075 1080 

Thr Gly cys Gly Ala Cys Cys Thr Gly Gly Cys Gly cys Gly Cys 
1085 1090 1095 

Ala Cys Gly Thr Thr Thr Gly Cys Cys Cys Gly Gly Gly Ala Gly 
1100 1105 1110 

Ala Thr Gly Gly Gly Gly Gly Ala Gly Gly Cys Thr Ala Ala Cys 
1115 1120 1125 

<210> 177 
<211> 1128 
<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) sequence-3 
<400> 177 

atggcttcgt accccggcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 
ggccatagca accgacgtac ggcgttgcgc cctcgccggc agcaagaagc cacggaagtc 120 
cgcccggagc agaaaatgcc cacgctactg cgggtttata tagacggtcc ccacgggatg 180 
gggaaaacca ccaccacgca actgctggtg gccctgggtt cgcgcgacga tatcgtctac 240 
gtacccgagc cgatgactta ctggcgggtg ctgggggctt ccgagacaat cgcgaacatc 300 
tacaccacac aacaccgcct cgaccagggt gagatatcgg ccggggacgc ggcggtggta 360 
atgacaagcg cccagataac aatgggcatg ccttatgccg tgaccgacgc cgttctggct 420 
cctcatatcg ggggggaggc tgggagctca catgccccgc ccccggccct caccctcatc 480 
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cgccctcctg tgctacccgg ccgcgcggta 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc gtggccctca 


tcccgccgac 


cttgcccggc 


600 


accaacatcg 


tgcttggggc 


ccttccggag gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


ggacctggct atgctggctg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag tgcggcgggt 


cgtggcggga 


ggactgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacaacaacc 


tatataarat 


gtttgcctgg gccttggacg 


tcttggccaa acgcctccgt 


960 


tccatgcacg 


tctttatcct 


ggattacgac caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc cacgtcacca 


cccccggctc 


cataccgacg 


1080 


atatgcgacc 


tggcgcgcac 


gtttgcccgg gagatggggg 


aggctaac 




1128 



<210> 178 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) sequence-4 



<400> 178 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgatc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg tcttggccaa 
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cccatgcacg tctttatcct ggattacgac caatcgcccg ccggctgccg ggacgccctg 1020 

ctgcaactta cctccgggat ggtccagacc cacgtcacca ccccaggctc cataccgacg 1080 

atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctaac 1128 

<210> 179 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) sequence-5 



<400> 179 
atggcttcgt 


accccggcca 


tcagcacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgaccggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


ccacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ttcggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccaca 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 180 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) sequence-6 
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<400> 180 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


LyciLLLLciLci 


LLyyyydLdL 


nttatttarr 

y L La L L LaLL 


rtntttrnnn 
l i_y u u LLyyy 




y L LyyLLLLL 


Qnn 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 181 
<211> 1128 
<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) sequence-7 
<400> 181 

atggcttcgt acccctgcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 

ggccatagca accgacgtac ggcgttgcgc cctcgccggc agcaagaagc cacggaagtc 120 

cgcctggagc agaaaatgcc cacgctactg cgggtttata tagacggtcc tcacgggatg 180 

gggaaaacca ccaccacgca actgctggtg gccctgggtt cgcgcgacga tatcgtctac 240 

gtacccgagc cgatgactta ctggcaggtg ctgggggctt ccgagacaat cgcgaacatc 300 

tacaccacac aacaccgcct cgaccagggt gagatatcgg ccggggacgc ggcggtggta 360 

atgacaagcg cccagataac aatgggcatg ccttatgccg tgaccgacgc cgttctggct 420 
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cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


LydLLLLa Ld. 


trnnnnarar 

Lcyyyyac_d.(_ 


nttatttarr 


t. ty l l Lcyyy 


cccccgagtt 


gctggccccc 




aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 182 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) sequence-8 



<400> 182 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggatgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 
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cgaccccata tcggggacac gttatttacc ctgtttcggg cccccgagtt gctggccccc 900 

aacggcgacc tgtataacgt gtttgcctgg gccttggacg tcttggccaa acgcctccgt 960 

cccatgcacg tctttatcct ggattacgac caatcgcccg ccggctgccg ggacgccctg 1020 

ctgcaactta cctccgggat ggtccagacc cacgtcacca ccccaggctc cataccgacg 1080 

atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctaac 1128 

<210> 183 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) sequence-9 



<400> 183 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


qgccataaca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actqctqqtq 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctqqcaqqtq 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggatgc 


ggcggtggta 




atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


taaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


ttggccctca tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg aggctaac 




1128 



<210> 184 
<211> 1128 
<212> DNA 

<213> artificial sequence 
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<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-10 



<400> 184 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cacctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg cgtttacggg 


720 




a Ldv^y y Lyuy 


ntatrtnran 


ggcggcgggx 


cgtggcggga ggattgggga 


/ ou 


cagctttcgg 


ggacggccgt 


gccaccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccag 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 185 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-11 

<400> 185 

atggcttcgt acccctgcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 

ggccatagca accgacgtac ggcgttgcgc cctcgccggc agcaagaagc cacggaagtc 120 

cgcctggagc agaaaatgcc cacgctactg cgggtttata tagacggtcc tcacgggatg 180 

gggaaaacca ccaccacgca actgctggtg gccctgggtt cgcgcgacga tatcgtctac 240 
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gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccagag 


gacaaacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgtca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 186 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-12 



<400> 186 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 
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cagcgccccg gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca atacggtgcg gtatctgcag ggcggcgggt 


cgtggcggga ggattgggga 


780 


caactftcaa 


aaacaaccat 


accaccccaa 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggcaccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa acgcctccgt 


you 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 


<210> 187 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-13 


thymidine kinase) 




<400> 187 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


cjgccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


ttgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctagccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 
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atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctaac 



1128 



<210> 188 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-14 



<400> 188 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


ttgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 189 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-15 

<400> 189 

atggcttcgt acccctgcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 
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ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctt 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


ttgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 




traoonarar 


y i La l l loll 


ctgtttcggg 


cccccgagtt 






aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 190 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-16 

<400> 190 

atggcttcgt acccctgcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 

ggccatagca accgacgtac ggcgttgcgc cctcgccggc agcaagaagc cacggaagtc 120 

cgcctggagc agaaaatgcc cacgctactg cgggtttata tagacggtcc tcacgggatg 180 

gggaaaacca ccaccacgca actgctggtg gccctgggtt cgcgcgacga tatcgtctac 240 

gtacccgagc cgatgactta ctggcaggtg ctgggggctt ccgagacaat cgcgaacatc 300 

tacaccacac aacaccgcct cgaccagggt gagatatcgg ccggggacgc ggcggtggta 360 

atgacaagcg cccagataac aatgggcatg ccttatgccg tgaccgacgc cgttctggct 420 

cctcatatcg ggggggaggc tgggagctca catgccccgc ccccggccct caccctcatc 480 
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ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca tcccgccgac cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctagccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


L.ydL,v.L.L.dLd 


LL.yyyyciv.civ-. 


nttatttarr 

y L Ld L V. LdUL 


ctgtttcggg 


cccccgagtt 


gctggccccc 




aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 191 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-17 



<400> 191 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctagccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcggaccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 
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aacggcgacc tgtataacgt gtttgcctgg gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc tggcgcgcac gtttgcccgg gagatggggg 


aggctaac 




1128 


<210> 192 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-18 


thymidine kinase) 




<4uu> vsi 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


nacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctagccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


gttgccgagc 


cccagagcaa 


cgcggaccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 




aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 193 
<211> 1128 
<212> DNA 

<213> artificial sequence 
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<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-19 



<400> 193 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


catcctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctagccg 


cgattcgccg 


cgtttacggg 


720 


ctacttarca 

v» l y v_ v_ v_ y \_ v_ ci 


atar aataro 

c* Luv_yy i_ y \_ y 


atatrtorao 


yy*-yy<-yyy L 


<-y L yy^yyy a 


n n a t" t o o o n a 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 194 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-20 

<400> 194 

atggcttcgt atccctgcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 

ggccatagca accgacgtac ggcgttgcgc cctcgccggc agcaagaagc cacggaagtc 120 

cgcctggagc agaaaatgcc cacgctactg cgggtttata tagacggtcc tcacgggatg 180 

gggaaaacca ccaccacgca actgctggtg gccctgggtt cgcgcgacga tatcgtctac 240 
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gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccaccc 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 195 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type~l thymidine kinase) 
sequence-21 



<400> 195 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca tcgaccgcct 


ggccaaacgc 


660 
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cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccaaaacaa 


cacaaaccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggtcaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc tggcgcgcac gtttgcccgg 


gagatggggg 


aggctaac 




1128 


<210> 196 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-22 


thymidine kinase) 




<400> 196 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggta 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 
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atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctaac 



1128 



<210> 197 

<211> 1128 

<212> DNA 

<213> artificial sequence 



<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-23 



<400> 197 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcggtgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caagcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 198 

<211> 1128 

<212> DNA 

<213> arti f i ci al sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-24 

<400> 198 

atggcttcgt acccctgcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgt 60 
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ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata tagacggtcc tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 

ZJ ZJ ~t — / ZJ 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 

ZJ ZJ ^J ZJ 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 

ZJ ZJ ZJ ZJ ZJ ZJ ZJ ZJ 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gttggcgttc 

ZJ ZJ ZJ ZJ 


gtggccctca tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


CgaCCCCdtd 


Lcggggacac 


gLLdLLLdCC 


ctgtttcggg 


cccccgagtt 


gctggccccc 


Qnn 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 199 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-25 

<400> 199 

atggcttcgt acccctgcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 

ggccatagca accgacgtac ggcgttgcgc cctcgccggc agcaagaagc cacggaagtc 120 

cgcctggagc agaaaatgcc cacgctactg cgggtttata tagacggtcc tcacgggatg 180 

gggaaaacca ccaccacgca actgctggtg gccctgggtt cgcgcgacga tatcgtctac 240 

gtacccgagc cgatgactta ctggcaggtg ctgggggctt ccgagacaat cgcgaacatc 300 

tacaccacac aacaccgcct cgaccagggt gagatatcgg ccggggacgc ggcggtggta 360 

atgacaagcg cccagataac aatgggcatg ccttatgccg tgaccgacgc cgttctggct 420 

cctcatatcg ggggggaggc tgggagctca catgccccgc ccccggccct caccctcatc 480 
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ttcgaccgcc 

ZJ ZJ 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 

zj zj zj zj zj zj 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


qcqagcqgct 

ZJ ZJ ZJ ZJZJ 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag ggcggcgggt 


cgtggcggga 

ZJ ZJ ZJ ZJ ZJ ZJ 


ggattgggga 

ZJ ZJ ZJ ZJ ZJ ZJ 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 

ZJ ZJ ZJ ZJ 


840 


cgaccccaca 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaaactc 


cataccaaca 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 


<210> 200 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-26 


thymidine kinase) 




<400> 200 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


cgccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg cccccgagtt 
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900 
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aacggcgacc tgtacaacgt 


gtttgcctgg gccttggacg 


tcttggccaa acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


gatccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 


<210> 201 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-27 


thymidine kinase) 




<4UU> ^Ul 

atggcttcgt 


accccggcca 


tcagcacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcccggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


ccacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccaca 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


yuu 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 202 
<211> 1128 
<?.12> DNA 

<213> artificial sequence 
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<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-28 



<400> 202 
atggcttcgt 


accccggcca 


tcagcacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcccggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


ccacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gttggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


K, Ly V_ L Ly l_ \~CL 


ciucn-yy Lyv„y 


gtatctgcag ggcggcgggt 


*-y *-yy*-yyy a 


aaattaaaaa 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 203 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-29 

<400> 203 

atggcttcgt accccggcca tcagcacgcg tctgcgttcg accaggctgc gcgttctcgc 60 

ggccatagca accgacgtac ggcgttgcgc cctcgccggc agcaagaagc cacggaagtc 120 

cgcccggagc agaaaatgcc cacgctactg cgggtttata tagacggtcc ccacgggatg 180 

gggaaaacca ccaccacgca actgctggtg gccctgggtt cgcgcgacga tatcgtctac 240 
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gtacccgagc 


cgatgactta 
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ctggcgggtg ctgggggctt ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccaca 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 




1" n 1" 3 1~ 7k a c n 1" 


gtttgcctgg 


gccttggacg 


tcttggccaa 




Qfift 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


cccaaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 204 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-30 



<400> 204 
atggcttcgt 


accccggcca 


tcagcacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcccggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggccc 


ccacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 
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cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca atacggtgcg gtatctgcag ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccaca 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


you 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


cccaaggctc 


cataccgacg 


1080 


atctgcgacc tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 


<210> 205 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-31 


thymidine kinase) 




<400> 205 
atggcttcgt 


accccggcca 


tcagcacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcccggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


ccacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


gccggcgggt 


cgtggcggca 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccaca 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 
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atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctaac 



1128 



<210> 206 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-32 



<400> 206 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 207 

<211> 1131 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-33 

<400> 207 

atggcttcgt acccctgcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 
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ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


cccaggctcc 


ataccgacga 


1080 


tctgcgacct 


ggcgcgcacg 


tttgcccggg 


agatggggga 


ggctaactga 


a 


1131 


<210> 208 
<211> 1131 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-34 


thymidine kinase) 




<400> 208 
atggcttcgt 


acccctgcca tcaacacgcg tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca actgctggtg gccctgggtt 


cgcgcgacga tatcgtctac 




gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 
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ttcgaccgcc 


atcccatcgc 


cgccctcctg tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


ccaggccgtg 


ctggcgttcg tggccctcat 


cccgccgacc 


ttgcccggca 


600 


caaacatcgt 


gttgggggcc 


cttccggagg 


acagacacat 


cgaccgcctg 


gccaaacgcc 


660 


agcgccccgg 


cgagcggctt 


gacctggcta tgctggccgc 


gattcgccgc 


gtttacgggc 


720 


tgcttgccaa 


tacggtgcgg 


tatctgcagg 


gcggcgggtc 


gtggcgggag 

ZJ ZJ ZJ ZJ ZJ ZJ ZJ 


gattggggac 


780 


agctttcggg 


gacggccgtg 


ccgccccagg 


gtgccgagcc 


ccagagcaac 


gcgggcccac 

Zf ZJ ZJ Zt 


840 


gaccccatat 


cggggacacg 


ttatttaccc 


tgtttcgggc 


ccccgagttg 


ctggccccca 


900 


acggcgacct 


gtataacgtg 


tttgcctggg 


ccttggacgt 


cttggccaaa 


cgcctccgtc 


960 


ccatgcacgt 


ctttatcctg 


gattacgacc 


aatcgcccgc 


cggctgccgg 


gacgccctgc 


1020 


tgcaacttac 


ctccgggatg 


gtccagaccc 


acgtcaccac 


cccaggctcc 


ataccgacga 


1080 


tctgcgacct 


ggcgcgcacg 


tttgcccggg 


agatggggga 


ggctaactga 


a 


1131 


<210> 209 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-35 


thymidine kinase) 




<400> 209 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


taaacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg cccccgagtt 
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gctggccccc 


900 
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aacggcgacc tgtataacgt gtttgcctgg gccttggacg tcttggccaa acgcctccgt 960 

cccatgcacg tctttatcct ggattacgac caatcgcccg ccggctgccg ggacgccctg 1020 

ctgcaactta cctccgggat ggtccagacc cacgtcacca ccccaggctc cataccgacg 1080 

atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctaac 1128 

<210> 210 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-36 



<400> 210 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaacacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


qtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


i.acaccacac 


aacaccgccr 


cgaccaggg l 


gaga ra tcgg 


ccggggacgc 


ggcggtggta 




atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


cigcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 211 
<211> 1128 
<212> DNA 

<213> artificial sequence 
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<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-37 



<400> 211 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagt 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 




dtdLyytycy 


yLdLCLycdy 


ggcggcgggt 


cgtggcggga 




/ OU 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 212 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-38 

<400> 212 

atggcttcgt acccctgcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 

ggccatagca accgacgtac ggcgttgcgc cctcgccggc agcaagaagc cacggaagtc 120 

cgcctggagc agaaaatgcc cacgctactg cgggtttata tagacggtcc tcacgggatg 180 

gggaaaacca ccaccacgca actgctggtg gccctgggtt cgcgcgacga tatcgtctac 240 
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gtacccgagc 


cgatgactta 


sequence "listing_ST25.txt 
ctggcaggtg ctgggggctt ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctaccggg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacaacaacc 

\A V» \* \A \mm KJk ^ ^ 


tatataacat 


gtttgcctgg 


gccttggacg 


tcttggccaa 


aCQCCtCCQt 
v, y v_ \_ i_ v_ v_ y i_ 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 213 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-39 



<400> 213 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgccctgc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 
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cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca atacggtgcg gtatctgcag ggcggcgggt 


cgtggcggga ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


aataccaaac 


cccaaaacaa 


cacaaaccca 

\a \m \a V* ^ X* V* 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


OCA 

ybl) 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta cctccgggat ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc tggcgcgcac gtttgcccgg 


gagatggggg 


aggctaac 




1128 


<210> 214 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-40 


thymidine kinase) 




<400> 214 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atatggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 
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atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctaac 



1128 



<210> 215 

<211> 1128 

<212> DNA 

<213> arti f i ci al sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-41 



<400> 215 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacat 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 216 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-42 

<400> 216 

atggcttcgt acccctgcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 
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ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctaccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 


<210> 217 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-43 


thymidine kinase) 




<400> 217 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcccggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


ccacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga tatcgtctac 


24U 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 
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ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 


<210> 218 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-44 


thymidine kinase) 




<400> 218 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 
Page ; 


cccccgagtt 
206 


gctggccccc 


900 
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aacggcgacc tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa acgcctccgt 


960 


cccatgcacg tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


cccccggctc 


cataccgacg 


1080 


atctgcgacc tggcgcgcac gtttgcccgg ggagtggggg 


aggctaac 




1128 


<210> 219 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-45 


thymidine kinase) 




<400> 219 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcccggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


ccacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


gqcgqtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcggta 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gtgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 




aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


catgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctcac 




1128 



<210> 220 
<211> 1128 
<212> DNA 

<213> artificial sequence 
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<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-46 



<400> 220 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tetgegtteg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


caeggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


tgggtttata tagaeggtec 


teaegggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tategtctae 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatategg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


LLgCLLyCCa 


•a 4- *a r~ r\ r\~\~ n r r\ 

d tdcgg uy eg 


g La lc tgcag 


ggcggcgggt 


cgtggcggga 


ggatxgggga 


70A 

/ oU 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


teggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgecegg 


gagatggggg 


aggctaac 




1128 



<210> 221 

<211> 1128 

<212> DNA 

<213> arti f i ci al sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-47 

<400> 221 

atggcttcgt accccggcca tcaacacgcg tetgegtteg accaggctgc gcgttctcgc 60 

ggccatagca accgacgtac ggcgttgcgc cctcgccggc agcaagaagc caeggaagtc 120 

cgcctggagc agaaaatgcc cacgctactg egggtttata tagaeggtec teaegggatg 180 

gggaaaacca ccaccacgca actgctggtg gccctgggtt cgcgcgacga tategtctae 240 
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gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcggta 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gtgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


catgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg aggctcac 




1128 



<210> 222 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-48 



<400> 222 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


aacaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgttacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc gtggccctca tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 
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cagcgccccg gcgagcggct tgacctggct atgctggccg cgattcgccg cgtttacgag 720 

ctgcttgcca atacggtgcg gtatctgcag ggcggcgggt cgtggcggga ggattgggga 780 

cagctttcgg ggacggccgt gccgccccag ggtgccgagc cccagagcaa cgcgggccca 840 

cgaccccata tcggggacac gttatttacc ctgtttcggg cccccgagtt gctggccccc 900 

aacggcgacc tgtataacgt gtttgcctgg gccttggacg tcttggccaa acgcctccgt 960 

cccatgcacg tctttatcct ggattacgac caatcgcccg ccggctgccg ggacgccctg 1020 

ctgcaactta cctccgggat ggtccagacc cacgtcacca cccccggctc cataccgacg 1080 

atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctaac 1128 

<210> 223 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-49 



<400> 223 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


aacaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


cjggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgttacccgg 


ccgtgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacgag 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


cccccggctc 


cataccgacg 


1080 
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atctgcgacc cggcgcgcac gtttgcccgg gagatggggg aggctaac 1128 



<210> 224 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-50 



<400> 224 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


aacaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgataactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccggc 


atcccatcgc 


cgccctcctg 


tgttacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacgag 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


cccccggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 225 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-51 

<400> 225 

atggcttcgt accccggcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 
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ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggtcctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca tcgaccgcct ggccaaacgc 


660 


cagcgccccg 


gtgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga ggattgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


rciArrrrata 

>- y ci V- v. v. v_ a. ia 




y 1. UC* L l> L.C4.V.V. 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg tcttggccaa acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


catgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 226 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-52 

<400> 226 

atggcttcgt accccggcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 

ggccatagca accgacgtac ggcgttgcgc cctcgccggc agcaagaagc cacggaagtc 120 

cgcctggagc agaaaatgcc cacgctactg cgggtttata tagacggtcc tcacgggatg 180 

gggaaaacca ccaccacgca actgctggtg gccctgggtt cgcgcgacga tatcgtctac 240 

gtacccaagc cgatgactta ctggcgggtg ctgggggctt ccgagacaat cgcgaacatc 300 

tacaccacac aacaccgcct cgaccagggt gagatatcgg ccggggacgc ggcggtggta 360 

atgacaagcg cccagataac aatgggcatg ccttatgccg tgaccgacgc cgttctggct 420 

cctcatatcg ggggggaggc tgggagctca catgccccgc ccccggccct caccctcatc 480 
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ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggtcctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct ggccaaacgc 


660 


cagcgccccg 


gtgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


catgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 


<210> 227 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-53 


thymidine kinase) 




<400> 227 
atggcttcgt 


accccggcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccaagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggtcctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gtgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg cccccgagtt 
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aacggcgacc tgtacaacgt gtttgcctgg gccttggacg tcttggccaa acgcctccgt 960 

cccatgcacg tctttatcct ggattacgac caatcgcccg ccggctgccg ggacgccctg 1020 

ctgcaactta cctccgggat ggtccagacc catgtcacca ccccaggctc cataccgacg 1080 

atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctaac 1128 

<210> 228 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-54 



<400> 228 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgt 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


aacaagaagc 


cacggaagtc 


120 


cgcccggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 

ZJ ZJ ZJ ZJ 


gccctgggtt 

ZJ 7 7 7 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctqqcqqqtq 

33 333 ZJ 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aaccccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgttacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgcttggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggctg 


cgattcgccg 


cgtttacgag 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggactgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


catgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


cggctaac 




1128 



<210> 229 
<211> 1128 
<212> DNA 

<213> artificial sequence 
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<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-55 



<400> 229 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


aacaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgttacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgcttggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggctg 


cgattcgccg 


cgtttacggg 


720 


LLdLL lyLLd 


atarnntnrn 

dLdLyytycy 


y LdtL ty Lay 


xgcggcgggx 


cgtggcggga ggactgggga 


f Ov 


cagctttcgg 


ggacggccgt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


catgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctcac 




1128 



<210> 230 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-56 

<400> 230 

atggcttcgt accccggcca ttagcacgcg tctgcgttcg accaggctgc gcgttctcgc 60 

ggccatagca accgacgtac ggcgttgcgc cctcgccggc agcaagaagc cacggaagtc 120 

cgcccggagc agaaaatgcc cacgctactg cgggtttata tagacggtcc ccacgggatg 180 

gggaaaacca ccaccacgca actgctggtg gccctgggtt cgcgcgacga tatcgtctac 240 

Page 215 



sequence listing_ST25.txt 



gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccaca 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 231 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-57 



<400> 231 
atggcttcgt 


accccggcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cacctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcggta 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggtcctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 
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cagcgccccg gtgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca atacggtgcg gtatctgcag 


tgcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


you 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta cctccgggat ggtccagacc 


cacgtcacca 


cccccggctc 


cataccgacg 


1080 


atctgcgacc tggcgcgcac gtttgcccgg gagatggggg 


aggctaac 




1128 


<210> 232 
<211> 1128 
<212> DNA 

<213> arti f i ci al sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-58 


thymidine kinase) 




<400> 232 
atggcttcgt 


accccggcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cacctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccacc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcggta 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggtcctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gtgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


cccccggctc 


cataccgacg 


1080 
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atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctaac 1128 



<210> 233 

<211> 1128 

<212> DNA 

<213> artificial sequence 



<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-59 



<400> 233 
atggcttcgt 


accccggcca 


tcagcacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcccggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


ccacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


gatccagacc 


cacgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 234 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-60 

<400> 234 

atggcttcgt acccctgcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 
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ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


aacaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


nacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgttacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


ccgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacgag 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


cccccggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 


<210> 235 
<211> 1128 
<212> DNA 

<213> arti f i ci al sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-61 


thymidine kinase) 




<400> 235 

atggcttcgt acccctgcca tcaacacgcg tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca accgacgtac ggcgttgcgc 


cctcgccggc 


aacaagaagc 


cacggaagtc 


120 


cgcctggagc agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 




gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 
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ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgttacccgg 


ccgcgcgata ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggtgttc 


gtggccctca tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


ccgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacgag 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 

ZJ ZJ ZJ ZJ ZJ ZJ ZJ 


cgtggcggga ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


LyaLLLLa La 




n1" 1" 7\ r r 


ctgtttcggg 


cccccgagtt 


gctggccccc 




aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacgtcacca 


cccccggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 



<210> 236 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<:223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-62 



<400> 236 
atggcttcgt 


accccggcca 


tcagcacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cacccggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


ccacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 
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aacggcgacc tgtacaacgt gtttgcctgg gccttggacg tcttggccaa acgcctccgt 960 

cccatgcacg tctttatcct ggattacgac caatcgcccg ccggctgccg ggacgccctg 1020 

ctgcaactta cctccgggat gatccagacc cacgtcacca ccccaggctc cataccgacg 1080 

atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctcac 1128 

<210> 237 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-63 



<400> 237 
atggcttcgt 


accccggcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 

ZJ ZJ ZJ ZJ ZJ 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cacctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actqctggtg 

ZJ ZJ ZJ ZJ 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 




atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcggta 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggtcctca tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gtgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gttggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


catgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctcac 




1128 



<210> 238 
<2!11> 1128 
<2!12> DNA 

<213> artificial sequence 
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<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-64 


thymidine kinase) 




^ a r\r\^ TOO 

atggcttcgt 


accccggcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc cacggaagtc 


120 


cacctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcggta ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggtcctca 


tcccgccgac cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg ccgttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


LLLLLyayLL Lyyn_*_t_t_ 




aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


gatccagacc 


cacgtcacca 


ccccaggctc cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 


1128 



<210> 239 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-65 

<400> 239 

atggcttcgt accccggcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 

ggccatagca accgacgtac ggcgttgcgc cctcgccggc agcaagaagc cacggaagtc 120 

cacctggagc agaaaatgcc cacgctactg cgggtttata tagacggtcc tcacgggatg 180 

gggaaaacca ccaccacgca actgctggtg gccctgggtt cgcgcgacga tatcgtctac 240 
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cgatgactta 
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ctggcgggtg ctgggggctt ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt gagatatcgg 


ccggggacgc ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcggta ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggtcctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaatgc 


660 


cagcgccccg 


gtgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga ggattgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


aataccaaac 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


catgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctcac 




1128 


<210> 240 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-66 


thymidine kinase) 




<400> 240 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


Attn 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 
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cagcgccccg gcgagcggct tgacctggct atgctggccg cgattcgccg cgtttacggg 720 

ctgcttgcca atacggtgcg gtatctgcag ggcggcgggt cgtggcggga ggattgggga 780 

cagctttcgg ggacggccgt gccgccccag ggtgccgagc cccagagcaa cgcgggccca 840 

cgaccccata tcggggacac gttatttacc ctgtttcggg cccccgagtt gctggccccc 900 

aacggcgacc tgtataacgt gtttgcctgg gccttggacg tcttggccaa acgcctccgt 960 

cccatgcacg tctttatcct ggattacgac caatcgcccg ccggctgccg ggacgccctg 1020 

ctgcaactta cctccgggat ggtccagacc cacgtcacca ccccaggctc cataccgacg 1080 

atctgcgacc tggtgcgcac gtttgcccgg gagatggggg aggctaac 1128 

<210> 241 

<211> 1128 

<212> DNA 

<213> artificial sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-67 



<400> 241 
atggcttcgt 


accccggcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cacctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atccaatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcggta 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggtcctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gtgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


catgtcacca 


ccccaggctc 


cataccgacg 


1080 
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atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctcac 1128 



<210> 242 

<211> 1128 

<212> DNA 

<213> artificial sequence 



<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-68 



<400> 242 
atggcttcgt 


accccggcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cacctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atccaatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcggta 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggtcctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccacct 


ggccaaacgc 


660 


cagcgccccg 


gtgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


catgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctcac 




1128 



<210> 243 

<211> 1128 

<212> DNA 

<213> arti f i ci al sequence 
<220> 

<223> HSVl-tk (herpes simplex virus type-1 thymidine kinase) 
sequence-69 

<400> 243 

atggcttcgt acccctgcca tcaacacgcg tctgcgttcg accaggctgc gcgttctcgc 60 
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ggccatagca 

• J ZJ ZJ 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 

ZJ ZJ ZJ ZJ ZJ 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 

ZJ ZJ ZJ ZJ ZJ 


ggcggtggta 

ZJ ZJ ZJ ZJ ZJ ZJ 


360 


atgacaagcg 


cccagataac 


aatggacatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 

ZJ ZJ ZJ ZJ 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 

ZJ ZJ ZJ ZJ ZJ ZJ 


ggattgggga 

ZJ ZJ ZJ ZJ ZJ ZJ 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 

ZJ ZJ 


cgcgggccca 

ZJ ZJ ZJ ZJ 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtataacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg 


ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


cacatcacca 


ccccaaactc 


cataccaaca 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctaac 




1128 


<210> 244 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-70 


thymidine kinase) 




<400> 244 

atggcttcgt accccggcca tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


ccjcccggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


ccacgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcgggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 
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ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcggta 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 

Zf ZJ ZJ ZJ 


gtggtcctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 

Zt ZJ Zf Zt Z* ZJ 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gtgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctacttgcca 


atacggtgcg 


gtatctgcag 


tgcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggcctt 


gacgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggaaac 


gttatttacc 


ctgtttcggg 


cccccgagtt 


gctggccccc 


900 


aacggcgacc 


tgtacaacgt 


gtttgcctgg 


gccttggacg 


tcttggccaa 


acgcctccgt 


960 


cccatgcacg 


tctttatcct 


ggattacgac 


caatcgcccg 


ccggctgccg ggacgccctg 


1020 


ctgcaactta 


cctccgggat 


ggtccagacc 


catgtcacca 


ccccaggctc 


cataccgacg 


1080 


atctgcgacc 


tggcgcgcac 


gtttgcccgg 


gagatggggg 


aggctcac 




1128 


<210> 245 
<211> 1128 
<212> DNA 

<213> artificial sequence 










<220> 

<223> HSVl-tk (herpes simplex virus type-1 
sequence-71 


thymidine kinase) 




<400> 245 
atggcttcgt 


acccctgcca 


tcaacacgcg 


tctgcgttcg 


accaggctgc 


gcgttctcgc 


60 


ggccatagca 


accgacgtac 


ggcgttgcgc 


cctcgccggc 


agcaagaagc 


cacggaagtc 


120 


cgcctggagc 


agaaaatgcc 


cacgctactg 


cgggtttata 


tagacggtcc 


tcgcgggatg 


180 


gggaaaacca 


ccaccacgca 


actgctggtg 


gccctgggtt 


cgcgcgacga 


tatcgtctac 


240 


gtacccgagc 


cgatgactta 


ctggcaggtg 


ctgggggctt 


ccgagacaat 


cgcgaacatc 


300 


tacaccacac 


aacaccgcct 


cgaccagggt 


gagatatcgg 


ccggggacgc 


ggcggtggta 


360 


atgacaagcg 


cccagataac 


aatgggcatg 


ccttatgccg 


tgaccgacgc 


cgttctggct 


420 


cctcatatcg 


ggggggaggc 


tgggagctca 


catgccccgc 


ccccggccct 


caccctcatc 


480 


ttcgaccgcc 


atcccatcgc 


cgccctcctg 


tgctacccgg 


ccgcgcgata 


ccttatgggc 


540 


agcatgaccc 


cccaggccgt 


gctggcgttc 


gtggccctca 


tcccgccgac 


cttgcccggc 


600 


acaaacatcg 


tgttgggggc 


ccttccggag 


gacagacaca 


tcgaccgcct 


ggccaaacgc 


660 


cagcgccccg 


gcgagcggct 


tgacctggct 


atgctggccg 


cgattcgccg 


cgtttacggg 


720 


ctgcttgcca 


atacggtgcg 


gtatctgcag 


ggcggcgggt 


cgtggcggga 


ggattgggga 


780 


cagctttcgg 


ggacggccgt 


gccgccccag 


ggtgccgagc 


cccagagcaa 


cgcgggccca 


840 


cgaccccata 


tcggggacac 


gttatttacc 


ctgtttcggg 
Page ; 


cccccgagtt 
227 


gctggccccc 


900 



sequence 1isting_ST25.txt 

aacggcgacc tgtataacgt gtttgcctgg gccttggacg tcttggccaa acgcctccgt 960 

cccatgcacg tctttatcct ggattacgac caatcgcccg ccggctgccg ggacgccctg 1020 

ctgcaactta cctccgggat ggtccagacc cacgtcacca ccccaggctc cataccgacg 1080 

atctgcgacc tggcgcgcac gtttgcccgg gagatggggg aggctaac 1128 
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